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Abstract 



< 

, This paper constructs translation invariant operators on L^(R'^), 

which are Lipschitz continuous to the action of diffeomorphisms. A 
scattering propagator is a path ordered product of non-hnear and 
^ . non-commuting operators, each of which computes the modulus of a 

J> I wavelet transform. A local integration defines a windowed scattering 

■ transform, which is proved to be Lipschitz continuous to the action of 

. diffeomorphisms. As the window size increases, it converges to a 

CN I wavelet scattering transform which is translation invariant. Scattering 

coefficients also provide representations of stationary processes. Ex- 
pected values depend upon high order moments and can discriminate 
processes having the same power spectrum. Scattering operators are 
extended on L^(G), where G is a compact Lie group, and are invari- 
ant under the action of G. Combining a scattering on L^(M'^) and on 
r> ! Li'^{SO{d)) defines a translation and rotation invariant scattering on 

C^- L2(R'^). 



1 Introduction 

Symmetry and invariants, which play a major role in physics |6], are mak- 
ing their way into signal information processing. The information content of 
sounds or images is typically not affected under the action of finite groups 

*This work was supported by the ANR-lO-BLAN-0126 grant. 
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such as translations or rotations, and it is stable to the action of small diffeo- 
morphisms that deform signals [21] . This motivates the study of translation- 
invariant representations of L^(]R'^) functions, which are Lipschitz continuous 
to the action of diffeomorphisms, and which keep high-frequency information 
to discriminate different types signals. Invariance to the action of compact 
Lie groups and rotations are then studied. 

We first concentrate on translation invariance. Let Lcf{x) = f{x — c) 
denote the translation of / G L^(]R'^) by c G M.'^. An operator $ from L^(M'^) 
to a Hilbert space "H is translation-invariant if ^{Lcf) = $(/) for all / e 
L^(M'^) and c G M'^. Canonical translation invariant operators satisfy $(/) = 
Laf for some a G M'^ which depends upon / [TJ]. The modulus of the Fourier 
transform of / is an example of non-canonical translation invariant operator. 
However, these translation invariant operators are not Lipschitz continuous 
to the action of diffeomorphisms. Instabilities to deformations are well-known 
to appear at high frequencies [10]. The major difficulty is to maintain the 
Lipschitz continuity over high frequencies. 

To preserve stability in L^(M'^) we want $ to be nonexpansive: 

V(/, h) G L\R')' , ||<|.(/) - <\\f-h\\. 

It is then sufficient to verify its Lipschitz continuity relatively to the action 
of small diffeomorphisms close to translations. Such a diffeomorphism trans- 
forms X G M'' into X — t{x), where t{x) G is the displacement field. Let 
Lrfi^) = f{x — t{x)) denote the action of the diffeomorphism 1 — r on /. 
Lipschitz stability means that ||$(/) — is bounded by the "size" of 

the diffeomorphism and hence by the distance between the 1 — r and 1, up to 
a multiplicative constant multiplied by ||/||. Let |t(x)| denote the Euchdean 
norm in Mf^, |Vr(a;)| the sup norm of the matrix Vt{x), and \Ht{x)\ the 
sup norm of the Hessian tensor. The weak topology on diffeomorphisms 
defines a distance between 1 — r and 1, over any compact subset Q of M.'^, 
by: 

d^{l, 1 — r) = sup |t(x)| + sup |Vr(x)| + sup \Ht{x)\. (1) 

x&Q x£f2 xGQ 

A translation invariant operator $ is said to be Lipschitz continuous to the 
action of diffeomorphisms if for any compact i7 C M'^ there exists C such 
that for all / G L2(R"') supported in and all r G C2(M^) 

||$(/) - HLJ)\\n < C 11/11 (sup |Vr(x)| + sup \Ht{x)\) . (2) 
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Since $ is translation invariant, the Lipschitz upper bound does not depend 
upon the maximum translation amplitude sup^ \ t{x)\ of the diffeomorphism 
metric ([1]). The Lipschitz continuity ([2]) implies that $ is invariant to global 
translations, but it is much stronger. $ is almost invariant to "local transla- 
tions" by t{x), up to the first and second order deformation terms. 

High frequency instabilities to deformations can be avoided by grouping 
frequencies into dyadic packets in M'^, with a wavelet transform. However, a 
wavelet transform is not translation invariant. A translation invariant oper- 
ator is constructed with a scattering procedure along multiple paths, which 
preserves the Lipschitz stability of wavelets to the action of diffeomorphisms. 
A scattering propagator is first defined as a path ordered product of non- 
linear and non-commuting operators, each of which computes the modulus of 
a wavelet transform [13] . This cascade of convolutions and modulus can also 
be interpreted as a convolutional neural-network pLlJ- A windowed scattering 
transform is a nonexpansive operator which locally integrates the scattering 
propagator output. For appropriate wavelets, the main theorem in Section 
[2]prove that a windowed scattering preserves the norm: ||$(/)||^ = ||/|| for 
all / G L^(R"'), and it is Lipschitz continuous to diffeomorphisms. 

When the window size increases, windowed scattering transforms con- 
verge to a translation invariant scattering transform, defined on a path set 
Voo which is not countable. Section [3] introduces a measure n and a metric on 
Voo, and proves that scattering transforms of functions in L^(]R'^) belong to 
L^(Poo, c^A*)- A scattering transform has striking similarities with a Fourier 
transform modulus, but a different behavior relatively to the action of dif- 
feomorphisms. Numerical examples are shown. An open conjecture remains 
on conditions for strong convergence in L^(R'^). 

The representation of stationary processes with the Fourier power spec- 
trum results from the translation invariance of the Fourier modulus. Simi- 
larly, Section H] defines an expected scattering transform which maps station- 
ary processes to an F space. Scattering coefficients depend upon high order 
moments of stationary processes, and can thus discriminate processes having 
same second-order moments. As opposed to the Fourier spectrum, a scat- 
tering representation is Lipschitz continuous to random deformations up to 
a log term. For large classes of ergodic processes, it is numerically observed 
that the scattering transform of a single realization provides a mean-square 
consistent estimator of the expected scattering transform. 

Section [5] extends scattering operators to build invariants to actions of 
compact Lie groups G. The left action of g E G on f E L^(G) is denoted 
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Lgfi''^) = fid'^"^)- An operator $ on L^(G') is invariant to the action of 
G if ^{Lgf) = $(/) for all / E L^(G) and all g E G. Invariant scattering 
operators are constructed on L^(G') with a scattering propagator which iter- 
ates on a wavelet transform defined on L^(G'), and a modulus operator which 
removes complex phases. A translation and rotation invariant scattering on 
L^(M'^) is obtained by combining a scattering on L^(M'^) and a scattering on 
L^iSOid)). 

A software package is available at www.cmap.polytechnique.fr/scattering, 
to reproduce numerical experiments. Applications to audio and image clas- 
sification can be found in [H [Sj HJ [18] . 

Notations: ||r||oo = sup^gjj'* II^^IU = sup(^^„)g]j2d |r(x) - r{u)\, 

1 1 Vr I loo = sup3,gRd |Vr(x)| and ||ifr||oo = sup^gj^d \Ht{x)\ where \Ht{x)\ is 
the norm of the Hessian tensor. The inner product of (x, y) G M^'^ is x-y. The 
norm of / in a Hilbert space is ||/|| and in L^(M"'): = J \ f{x)\'^dx. The 
norm in L^{R'^) is ||/||i = J\f{x)\dx. We denote /(w) = / /(x) e"^^"^ do; 
the Fourier transform of /. We denote g o /(x) = f{gx) the action of a 
group element g E G. An operator R parametrized by p is denoted R\p] and 
R[Q] = {R[p]}p(zn. The sup norm of a linear operator A in L^(M'^) is denoted 
II A II and the commutator of two operators is [A , i?] = AB — BA. 

2 Finite Path Scattering 

To avoid high frequency instabilities under the action of diffeomorphisms, 
Section 12.21 introduces scattering operators, which iteratively apply wavelet 
transforms and remove complex phases with a modulus. Section 12.31 proves 
that a scattering is nonexpansive and preserves L^(M'') norms. Translation 
invariance and Lipschitz continuity to deformations are proved in Section [231 
and [231 

2.1 From Fourier to Littlewood-Paley Wavelets 

The Fourier transform modulus $(/) = |/| is translation- invariant. Indeed 
for c G W^, the translation Lcf{x) = f{x — c) satisfies Lcf{uj) = e~*'^''^/(a;) 
and hence |i^c/| = |/|- However, deformations lead to well-known instabili- 
ties at high frequencies p2j. This is illustrated with a small scaling operator, 
Lrf{x) = f{x — t(x)) = /((I — s)x), for r(x) = sx and || Vr||oo = \s\ < 1. If 
f{x) = e*^'^' 6{x) then scaling by 1 — s translates the central frequency ^ to 
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(1 — s)^. If 6^ is regular with a fast decay then 

lllQl- mil 1^1 ll^ll = l|Vr|Ule| 11/11 . (3) 

Since |^| can be arbitrarily large, $(/) = |/| does not satisfy the Lipschitz 
continuity condition when scaling high frequencies. The frequency dis- 
placement from ^ to (1 — s)^ has a small impact if sinusoidal waves are 
replaced by localized functions having a Fourier support which is wider at 
high frequencies. This is achieved by a wavelet transform [TJ [H], whose 
properties are briefly reviewed in this section. 

A wavelet transform is constructed by dilating a wavelet ip ^ L^(R'^) 
with a scale sequence {a^j^z for a > 1. For image processing, usually 
a = 2 [21 H]. Audio processing requires a better frequency resolution with 
typically a < 2^/^ [1]. To simplify notations, we normalize a = 2, with no 
loss of generality. Dilated wavelets are also rotated with elements of a finite 
rotation group G, which also includes the reflection —1 with respect to 0: 
—Ix = —X. If d is even then G is a subgroup of SO{d), and if d is odd then 
G is a finite subgroup of 0{d). A mother wavelet is dilated by and 
rotated by r G G 

ij2^rix) = 2'''i;i2^r-'x). (4) 

Its Fourier transform is ?/'2j>.(c<;) = ilj{2^^r^^uj). A scattering transform is 
computed with wavelets that can be written 

iIj{x) = e*"''^ 9{x) and hence ip{u) = 9{uj - r]) , (5) 

where 0{u) is a real function concentrated in a low frequency ball centered 
at u = 0, whose radius is of the order of vr. It results that ip{oj) is real 
and concentrated in a frequency ball of same radius, but centered at u = t]. 
To simplify notations we denote A = 2^r G 2^ x G, with |A| = 2^. After 
dilation and rotation, ipxi^) = 6{X^^uj — if) covers a ball centered at Xi] with 
a radius proportional to |A| = 2K The index A thus specifies the frequency 
localization and spread of ipx. 

As opposed to wavelet bases, a Littlewood-Paley wavelet transform [71 [H] 
is a redundant representation which computes convolution values at all x G 
M'^, without subsamphng: 

yxeW" , W[\]f{x) = f^iJx{x)= [ f{u)Mx-u)du . (6) 
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Its Fourier transform is 

W[X\fi^) = /M^a(c^) = f{co)ij{\-'u) . 

If / is real then f{-uj) = f*{io) and if -^{u) is real then W[-X]f = W[X]f*. 
Let denote the quotient of G with {— 1 , 1}, where two rotations r and 
— r are equivalent. It is sufficient to compute H^[2-^r]/ for "positive" rotations 
r G G~^. If / is complex then iy[2-'r]/ must be computed for all r £ G = 
G+ X {-1,1}. 

A wavelet transform at a scale 2-^ only keeps wavelets of frequencies 
2^ > 2'"^ . The low frequencies which are not covered by these wavelets 
are provided by an averaging over a spatial domain proportional to 2"^: 

Ajf = f^ 02^ with 027 (a;) = 2-'^-^0(2-^x) . (7) 

If / is real then the wavelet transform Wjf = |v4j/, ^W^[A]/j | is 
indexed by Aj = {A = 2^r : r G G^ , 2^ > 2"'^}. Its norm is 

If J = oo then Woof = {w[X]f}xeA^ with Aoo = 2^ x G+. Its norm is 
llW^oo/P = XIagAoo II^[A]/||^- For complex-valued functions /, all rotations 
in G are included by defining Wjf = IAj/, (w[\]A ^^^^ \ and Woof = 

-A6Aj 

< W^[A]/} AsAoo . The following proposition gives a standard Littlewood-Paley 
condition [7] so that Wj is unitary. 

Proposition 2.1 For any J E "Z or J = oo, Wj is unitary in the spaces of 
real-valued or complex-valued functions in L^(]R'^) if and only if for almost 
all 00 eR'^ 

oo 

P E J2\^{2-^^-'uj)\' = 1 and |0MP = /3 J] ^|^(2-^r-MP, 

j=—oo r£G j=—oo r^G 

(9) 

where (3 = 1 for complex functions and (3 = 1/2 for real functions. 
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Proof: If / is complex, (3 = 1 and one can verify that is equivalent to 
VJe Z , 10(2-^0;) |2+ J2 \^{2-'r-^uj)\^ = 1 . (10) 

j>-J,r&G 

Since W[2^r]f{u) = f{uj)'ip23r{^)., multiplying (fTOj) by \ f{uj)\'^, and applying 
the Plancherel formula proves that HVrj/p = For J = oo the same 

result is obtained by letting J go to oo. 

Conversely, if = then ffTOj) is satisfied for almost all u. 

Otherwise, one can construct a function / 7^ where / has a support in the 
domain of u where ffTOl) is not valid. With the Plancherel formula we verify 
that llW/'j/lP 7^ II/IP? which contradicts the hypothesis. 

If / is real then \f{u)\ = \fi-uj)\ so ||iy[2V]/|| = \\W[-2^r]f\\. Hence 
remains the same if r is restricted to and xjj is multiplied by a/2, 
which yields condition ([9]) with /3 = 1/2. □ 

In all the following, ip is a real function which satisfies the unitary con- 
dition ([9]). It implies that ip^O) = J ip{x) dx = and |0(rci;)| = |</'(u;)| for 
all r & G. We choose 0(w) to be real and symmetric so that is also real 
and symmetric and 0(rx) = for all r E G. We also suppose that and 
ip are twice differentiable and that their decay as well as the decay of their 
partial derivatives of order 1 and 2 is 0((1 + \x\)^'^^'^). 

A change of variable in the wavelet transform integral shows that if / is 
scaled and rotated, 2'-g o f = f{2}gx) with 2^g G 2^ x G, then the wavelet 
transform is scaled and rotated according to: 

W[X](2}gof)=2'goW[2-'g\]f. (11) 

Since is invariant to rotations in G we verify that A j commutes with 
rotations in G: Aj{g o f) = g o Ajf for all g E G. 

In dimension d = 1, G = {—1,1}. According to ([5]), to build a complex 
wavelet ip concentrated on a single frequency band, we set ip{uj) = for 
a; < 0. Following (|9]), Wj is unitary if and only if 



(3 m^"'m' = 1 and \4>{oo)\' = /3 ^ l^(2->|)P • (12) 
jez j=-oo 

If is a real wavelet which generates a dyadic orthonormal basis of L^(M) 
[H] then -0 = 2'?/'l^>o satisfies ([9]). Numerical examples in the paper are 
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computed with a complex wavelet ip calculated from a cubic-spline orthogonal 
Battle-Lemarie wavelet ip 

In any dimension d > 2, ip E L^(]R'^) can be defined as a separable product 
in frequency polar coordinates u = \u\ri, with t] in the unit sphere S'^ of M^: 

V(|a;U) G X S'^ , tP{\u\r]) = tij{\u;\) j{r]) . 

The one-dimensional function V'(l'^l) is chosen to satisfy (|T2l) . The Littlewood- 
Paley condition ([9]) is then equivalent to 

Vr/eS'^ , ^|7(r-ir/)r = l. 
2.2 Path Ordered Scattering 

Convolutions with wavelets defines operators which are Lipschitz continuous 
to the action of diffeomorphisms, because wavelets are regular and localized 
functions. However, a wavelet transform is not invariant to translations, and 
W^[A]/ = / ^ V'a translates when / is translated. The main difficulty is to 
compute translation invariant coefficients, which remain stable to the ac- 
tion of diffeomorphisms, and retain high frequency information provided by 
wavelets. A scattering operator computes such a translation invariant rep- 
resentation. We first explain how to build translation invariant coefficients 
from a wavelet transform, while maintaining stability to the action of diffeo- 
morphisms. Scattering operators are then defined, and their main properties 
are summarized. 

If U[X\ is an operator defined on L^(M'^), not necessarily linear but which 
commutes with translations, then J U[X]f{x) dx is translation invariant, if 
finite. W^[A]/ = f ipx commutes with translations but J W[X]f{x) dx = 
because / ip{x) dx = 0. More generally, one can verify that any linear trans- 
formation of py[A]/, which is translation invariant, is necessarily zero. To get 
a non-zero invariant, we set U[X\f = M[X\W[X\f where M[A] is a non-linear 
"demodulation" which maps 14^ [A]/ to a lower frequency function having 
a non-zero integral. The choice of M[A] must also preserve the Lipschitz 
continuity to diffeomorphism actions. 

If iplx) = e^^'^9{x) then il)\{x) = e'^^^'^ 6x{x), and hence 

W^[A]/(x) =e^^'^-^ (^/^*^A(a;)) with f\x) = e-'^" '' f{x) . (13) 
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The convolution f^*9x is a low-frequency filtering because 9x{ijj) = 9{\^^u) 
covers a frequency ball centered at w = 0, of radius proportional to |A|. A 
non-zero invariant can thus be obtained by canceling the modulation term 
^iXrj-x M[X\. A simple example is: 

M[X\h{x) = e-*^''-^e-^*('*(^''))/i(x) (14) 

where ^{h{Xri)) is the complex phase of h{Xr]). This non-linear phase reg- 
istration guarantees that M[A] commutes with translations. It results from 
(1T3|) that / M[\]W[\]f{x) dx = |/(Ar/)| |^(0)|. It recovers the Fourier modu- 
lus representation, which is translation invariant but not Lipschitz continuous 
to diffeomorphisms as shown in ([3]). Indeed, the demodulation operator M[A] 
in ( I14p commutes with translations but does not commute with the action of 
diffeomorphisms, and in particular with dilations. The commutator norm of 
M[A] with a dilation is equal to 2, even for arbitrarily small dilations, which 
explains the resulting instabilities. 

Lipschitz continuity under the action of diffeomorphisms is preserved if 
M[A] commutes with the action of diffeomorphisms. For L^(M'^) stability, 
we also impose that M[A] is nonexpansive. One can prove [1] that M[A] is 
then necessarily a pointwise operator, which means that M[X\h{x) only de- 
pends on the value of h at x. We further impose that ||M[A]/i|| = \\h\\ for all 
h G L^(]R'^), which then implies that |M[A]/i| = \h\. The most regular func- 
tions are obtained with M[X\h = \h\, which eliminates all phase variations. 
We derive from (ITS]) that this modulus maps Vr[A]/ into a lower frequency 
envelop: 

M[A]W^[A]/=|W^[A]/| = |/^^^^a| . 

Lower frequencies created by a modulus result from interferences. For exam- 
ple, if f{x) = cos(^i ■ x) + a cos(^2 ■ x) where i^i and ^2 are in the frequency 
band covered by ipx then \ fikipx{x)\ = 2~-^\ilJxiCi) + «'?/'a(^2) e*^^^"^^^'^'| oscil- 
lates at the interference frequency |^2 ~ ^il, which is smaller than |^i| and 

The integration J U[X\f{x) dx = J \ f 'kil!x{x) \ dx is translation invariant 
but it removes all the high frequencies of \f -k iIjx{x)\. To recover these high 
frequencies, a scattering also computes the wavelet coefficients of each U[X]f: 
{U[X]f iciljx'}x'- Translation invariant coefficients are again obtained with a 
modulus U[X']U[X]f = \U[X]fi.^x'\ and an integration / U[X']U[X]f{x) dx. If 
f{x) = cos(,^i ■x) + a cos(,^2 ■ x) with a < 1, |,^2 — ■Cil ^ l-^l and |^2 — ■Cil i^i the 
support of ipx' then f/[A']f/[A]/ is proportional to a\ipx{^i)\ \i^x'{\(,2 — 
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The second wavelet ipy captures the interferences created by the modulus, 
between the frequency components of / in the support of We now intro- 
duce the scattering propagator, which extends these decompositions. 

Definition 2.2 An ordered sequence p = (Ai, A^) with Xk G A^o = 2^ x 
is called a path. The empty path is denoted p = ^. Let f/[A]/ = \f -^ipx\ 
for f G L^(M'^). A scattering propagator is a path ordered product of non- 
commutative operators defined by 

U\p] = U[X^]...U[X2]U[X,] , (15) 

with f/[0] = Id. 

The operator U[p] is well defined on V{R'^) because ||f/[A]/|| < ||^/'a||i||/|| 
for all A G Aoo- The scattering propagator is a cascade of convolutions and 
modulus: 

U[p]f = I |/*V^AiKV^A2|---KV^A„| • (16) 

Each U[X] filters the frequency component in the band covered by ipx, and 
maps it to lower frequencies with the modulus. The index sequence p = 
(Ai,...,Am) is thus a frequency path variable. The scaling and rotation by 
2ig g 2^ X G of a path p is written 2'-gp = {2^gXi, ...,2^gXm)- The concate- 
nation of two paths is denoted p + p' = (Ai, A^, A'^, A^,), in particular 
p + X = (Ai, Xm, A). It results from ( ITB]) that 

U[p + p'] = U[p']U[p] . (17) 

Section [^?T] explains that if / is complex valued then its wavelet transform 
is W^f = {W[X]f} A6A^ whereas if / is real then W^of = {W[X]f}xeA^. 

-AGAoo 

If / is complex then at the next iteration U[Xi]f = \W[Xi]f \ is real so next 
stage wavelet transforms are computed only for A^ G Aqo. The scattering 
propagator of a complex function is thus defined over "positive" paths p = 
(Ai, A2, Am) G A™ and "negative" paths denoted —p = (— Ai, A2, Am). 
This is analogous to the positive and negative frequencies of a Fourier trans- 
form. If / is real then W[-Xi]f = W[Xi]f* so U[-Xi]f = U[Xi]f and hence 
U[—p]f = U[p]f. To simplify explanations, all results are proved on real 
functions with scattering propagators restricted to positive paths. These 
results apply to complex functions by including negative paths. 
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Definition 2.3 Let V^o be the set of all finite paths. The scattering trans- 
form of f E L^(]R'^) is defined for any p G Poo by 

Sf{p) = — J U[p]f{x) dx with Hp = J U[p]5{x) dx . (18) 

A scattering is a translation-invariant operator which transforms / G 
L^(]R'^) into a function of the frequency path variable p. The normahzation 
factor fip results from a path measure introduced in Section [31 Conditions 
are given so that /Xp does not vanish. This transform is then well-defined for 
any / G L^(M'^) and any p of finite length m. Indeed \\ipx\\i = W'ipWi so ( IT6|) 
implies that ||t/[p]/||i < ||/||i IIV'H]"- We shall see that a scattering transform 
has similarities with Fourier transform modulus, where the path p plays the 
role of a frequency variable. However, as opposed to a Fourier modulus, a 
scattering transform is stable to the action of diffeomorphisms, because it is 
computed by iterating on wavelet transforms and modulus operators, which 
are stable. For complex-valued functions, Sf is also defined on negative 
paths, and Sf{—p) = Sf{p) if / is real. 

If p 7^ then Sf{p) is non-linear but it preserves amplitude factors: 

V/iGM , S{fxf)ip) = \fx\Sfip). (19) 

A scattering has similar scaling and rotation covariance properties as a 
Fourier transform. If / is scaled and rotated, 2'-g o /(x) = f{2'-gx), then 
f fTTj) implies that U[X]{2'-g o /) = 2''g o U[2~'-gX]f and cascading this result 
shows that 

VpGPoo , U[p]i2'gof)=2'goU[2-'gp]f. (20) 
Inserting this result in the definition (I18p proves that 

S{2'gof){p) = 2-''^Sf{2-^gp) . (21) 

Rotating / thus rotates identically its scattering, whereas if / is scaled by 2' 
then the frequency paths p is scaled by 2~K The extension of the scattering 
transform in L^(M'^) is done as a limit of windowed scattering transforms, 
that we now introduce. 

Definition 2.4 Let J E 1^ and Vj be the set of finite paths p = (Ai, A^) 
with Afc G A J and hence |Afc| = 2^'= > 2^"^. A windowed scattering transform 
is defined for all p eVj by 

Sj\p]fi^) = U\p]f^<p24x) = [ U\p]f{u)^2-'{x~u)du . (22) 
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The convolution with 4'2j{x) = 2 '^'^0(2 ■^x) locahzes the scattering trans- 
form over spatial domains of size proportional to 2'^: 

Sj[p]f{x) = I |/*V'AihV'A2|---|*V'A™|*</'2^(x) . 

It defines an infinite family of functions indexed by Vj, denoted 

Sj[Vj]f = {Sj[p]fUrj . 

For complex- valued functions, negative paths are also included in Vj, and 
Sj[-p]f = Sj[p\f if / is real. 

Section [273] proves that for appropriate wavelets. ||/||^ = Xlpg-P/ 
However, the signal energy is mostly concentrated on a much smaller set of 
frequency-decreasing paths p = {\k)k<m for which |Afe+i| < |Afc|. Indeed, 
the propagator U[X] progressively pushes the energy towards lower frequen- 
cies. The main theorem of Section 12.51 proves that a windowed scattering is 
Lipschitz continuous to the action of diffeomorphisms. 

Since is continuous at 0, if / G L^(M'^) then its windowed scattering 
transform converges pointwise to its scattering transform when the scale 2'^ 
goes to oo: 

Vx G M^ hm 2''' Sj[p]f{x) = 0(0) [ U[p]f{u) du = 0(0) ^ip'S{p) . (23) 

J—^oo J 

However, when J increases, the path set Vj also increases. Section [3] shows 
that {Vj}j(zz defines a multiresolution path approximation of a much larger 
set Voo including paths of infinite length. This path set is not countable as 
opposed to each Vj, and Section |3] introduces a measure fi and a metric on 

Section [3^ extends the scattering transform Sf{p) to all / G L^(R'^) and 
to all p G Voo, and proves that Sf G Li'^{Voo,dfi). A sufficient condition is 
given to guarantee a strong convergence of Sjf to Sf, and it is conjectured 
that it is valid on L^(M°'). Numerical examples illustrate this convergence 
and show that a scattering transform has strong similarities with a Fourier 
transforms modulus, when mapping the path p to a frequency variable cu G 

2.3 Scattering Propagation and Norm Preservation 

We prove that a windowed scattering Sj is nonexpansive, and preserves the 
L^(M'^) norm. Family of operators indexed by a path set Q are written 
Sj[n] = {Sj[p]}p^n and U[n] = {U[p]}p^n- 
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A windowed scattering can be computed by iterating on the one-step 
propagator defined by 

Ujf = {Ajf,{U[X]f),^j,A, 

with Ajf = f -k (f)2J and U[X]f = \ f -kipxl. After calculating Ujf, applying 
again Uj to each U[X]f yields a larger infinite family of functions. The 
decomposition is further iterated by recursively applying Uj to each U[p]f. 
Since U[X]U\p] = U[p + X] and A jU[p] = Sj[p], it results that 

UjU[p]f = {Sj[p]f , {U[p + X]f),eAA . (24) 

Let Ay be the set of paths of length m, with Aj = {0}. It is propagated into 



UjU[Ay]f = {Sj[Ay]f , U[Ay+']f} . (25) 

Since Vj = UmenAy, one can compute Sj[Vj]f from / = ?7[0]/ by iteratively 
computing UjU[Ay]f for m going from to oo, as illustrated in Figured] 




Figure 1: A scattering propagator Uj applied to / computes each U[Xi]f = 
If-kipXj^l and outputs 5'j[0]/ = Applying Uj to each U[Xi]f computes 

all U[Xi,X2]f and outputs S'j[Ai] = U[Xi] •k(f)2J- Applying iteratively Uj to 
each U[p]f outputs Sj[p]f = U[p]f *02-' and computes the next path layer. 

Scattering calculations follow the general architecture of convolution neural- 
networks introduced by LeCun [TT]. Convolution networks cascade convolu- 
tions and a "pooling" non-linearity, which is here the modulus of a complex 
number. Convolution networks typically use kernels that are not predefined 
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functions such as wavelets, but which are learned with backpropagation algo- 
rithms. Convolution network architectures have been successfully applied to 
number of recognition tasks [T^ and are studied as models for visual percep- 
tion 121 [T7]. Relations between scattering operators and path formulations 
of quantum field physics are also studied in [9] . 

The propagator Ujf = {Ajf, (|W^[A]/|)aga,j} is nonexpansive because the 
wavelet transform Wj is unitary and a modulus is nonexpansive in the sense 
that ||a| — l&ll < |a — b\ for any (a, b) G C^. This is valid whether / is real or 
complex. As a consequence 

WUjf-Ujhf = \\Ajf~Ajhf+J2\\mX\f\-\W[\]h\f 

AeAj 

< \\Wjf-Wjhf<\\f-hf. (26) 

Since Wj is unitary, setting h = also proves that ||f/j/|| = ||/||, so Uj 
preserves the norm. 

For any path set Q the norms of Sj[Q]f and U[Q]f are 

\\sjmf = J2\\s-^m" iif/[^^]/ir = $^iif/[p]/ir- 

pen peQ, 

Since Sj[Pj] iterates on Uj which is nonexpansive, the following proposition 
derives that Sj[Vj] is also nonexpansive. 

Proposition 2.5 The scattering transform is nonexpansive: 

V(/, h) G L\R'f , \\Sj[Vj]f - Sj[Vj]h\\ < 11/ - /ill . (27) 



Proof: Since Uj is nonexpansive, it results from fl25|) that 

||f/[Ay]/-f/[Ay]/if > \\UjU[Ay]f-UjU[Ay]hr 

= \\Sj[A^]f - Sj[A^]hr + \\U[A^^']f - f/[Ar>f 

Summing these equations for m going from to oo proves that 

oo 

\\Sj[Vj]f - Sj[Vj]hr = \\Sj[Ay]f - Sj[Ay]hr < 11/ - /if . □ (28) 

m=0 
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Section fI72\ explains that each U[\]f = \f -^^ipxl captures the frequency 
energy of / over a frequency band covered by ipx and propagates this en- 
ergy towards lower frequencies. The following theorem proves this result by 
showing that the whole scattering energy ultimately reaches the minimum 
frequency 2~'^ and is trapped by the low-pass filter 02-/ • The propagated scat- 
tering energy thus goes to zero as the path length increases, and the theorem 
derives that = ||/||. This result also applies to complex- valued 

functions by incorporating negative paths (— Ai, A2, Am) in Vj. 

Theorem 2.6 A scattering wavelet ip is said to be admissible if there exists 
?7 G M'^ and p>0, with |p(a;)| < |</)(2a;)| and p(0) = 1, such that the function 

+00 

^(c) = \p{u - r^)r - 5^ A: (1 - \p{2-\uj - (29) 

k=l 

satisfies 



a= inf ^ ^^(2~-'r-^a;) |^/'(2--'r-^tj)P > . (30) 

~' '~ j=— 00 reG 

If the wavelet is admissible then 

^feL\R') , hm \\U[AJ]fr= hm V ||5j[A}]/f = (31) 

m, — ^no m. — ^no f * 



n=m 



and 



\Sj[rj]f\\ = 11/11 . (32) 



Proof: We first prove that limm^oo ||f^[^!F]/ll = is equivalent to having 
lim„^«,Er=n^ll^j[Aj]/f = Oandto \\Sj[Vj]f\\ = \\f\\. Since \\Ujh\\ = \\h\\ 
for any h G L2(M'^) and UjU[A'}]f = {5j[A}]/ , f/[Aj+^]}, 

\\u[A-j]fr = \\ujU[A-j]fr = wsAAm' + wiATvr- m 

Summing for m < n < 00 proves that limm^oo ||^[^!F]/II = is equivalent 
to lim„^ooE^=n^ll'5J[A}]/f = 0. Since / = f/[AO]/, summing m for 
< n < m also proves that 

m—l 
n=0 
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so \\Sj[Vj]fr = Z7=o WSjmff = ll/f if and only if lim^^^ \\U[AJ]\\ = 
0. 

We now prove that condition f l29|) implies that linim^oo = 0. It 

relies on the following lemma, which gives a lower bound of If-ktpxl convolved 
with a positive function. 

Lemma 2.7 If h > then for any f E L^(]R'^) 

\f ^ijjx\^ h > sup \f -k'lpx^k hr^\ with hr,{x) = h{x) e''^'' . (35) 



The lemma is proved by computing 



\f -kipx\*h{x) 



> 



f{v)i)x{u - v) dv 



h{x — u) du 



f{v)i)x{u - v) e^''(^-") h{x - u) dv 



du 




f{v)i)x{u - v) h{x - u) e^''(^"") dv du 



fiv) hiPx{x-v- u') h{u') e*""' du'dv 



f{v)ipx * hr^{x — v)dv 



\f -ki)x'*^hr^\ 



which finishes the lemma's proof. 

Appendix |X] uses this lemma to show that the scattering energy propa- 
gates progressively towards lower frequencies, and proves the following lemma. 

Lemma 2.8 // ^3^) is satisfied and 



then 



a 



j=0 rGG+ 



||t/[P,]/f <max(J+l,l)||/|r + 



(36) 



(37) 
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The class of function for which < oo is a logarithmic Sobolev 

class, corresponding to functions having an average modulus of continuity 
in L2(M^). Since 

+00 

\\u[vj]fr = Y.\\u[K^]f\\\ 

m=0 

if 11/ 11^ < 00 then fl371) implies that limm^oo llf^l^j"]/!! ~ 0- '^^^^ result is ex- 
tended in by density. Since e L^(M'^) and 0(0) = 1, any / e 
satisfies limji_^_oo ||/ — /n|| = 0, where /„ = f-kct)^^ and 02" (2^) = 2~"''^0(2~"'a;). 
We prove that Ximm^oo \\U[^y^]fn\\'^ = by showing that ||/n||«; < 00. Indeed 

\\Wpr]Ur = j \f{u)?m^uj)?\i^{2-=r-'uj)?duj 

because ■0 has a vanishing moment so | "0(^)1 = 0(|a;|), and the derivatives 
of are in L-^(M'^) so |c<;| \(t){u})\ is bounded. It results that ||/n|U < 00. 
Since f/[A™] is nonexpansive ||f/[Aj] / - U[AJ]fn\\ < ||/ - /nil so 

||f/[A7]/ll < ||/-/„|| + ||f/[Ay]/„||. 

Since lim„_^_oo 11/ — /n|| = and limm^oo ||f^[^!F]/nll = it results that 
lim^^^ \\U[Kj]ff = for any / e 'L^W'). □ 

The proof shows that the scattering energy propagates progressively to- 
wards lower frequencies. The energy of U[p\f is mostly concentrated along 
frequency-decreasing paths p = {Xk)k<m for which |Afc+i| < |Afc|. For exam- 
ple, if / = 5 then paths of length 1 have an energy ||t/[2-'r]5||^ = ||'02jv|P = 
2~'^"'||^||^- This energy is then propagated among all paths p E Vj. For a 
cubic spline wavelet in dimension d = 1, over 99.5% of this energy is con- 
centrated along frequency-decreasing paths. Numerical implementations of 
scattering transforms thus limits computations to these frequency decreas- 
ing paths. The scattering transform of a signal of size N is computed along 
all frequency- decreasing paths, with O(A^logA^) operations, by using a filter 
bank implementation [13j. 

The decay of Yl'^=m ll'5'j[Aj]/||^ implies that we can neglect all paths of 
length larger than some m > 0. The numerical decay of H^jfAjJ/p appears 
to be exponential in image and audio processing applications. The path 
length is limited to m = 3 in classification applications [H |3] . 
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Theorem 12.61 requires a unitary wavelet transform and hence an admissible 
wavelet which satisfies Littlewood-Paley condition /3X](jr)GZxG |'^(2-''^i^)P = 
1. There must also exist p > and r] E with \p{uj)\ < |0(2u;)| such 
that J2{jr)£ZxG li'i'^'' roj)\'^\p{2^ ru — 77) p is sufficiently large so that a > 0. 

This can be obtained if according to (jS]), ^{x) = e^^^9{x) and hence ip{uj) = 
6{u} — rj), where 6 and p have their energy concentrated over nearly the 
same low frequency domains. For example, an analytic cubic spline Battle- 
Lemarie wavelet is admissible in one dimension with rj = 37r/2. This is 
verified by choosing p to be a positive cubic box spline, in which case a 
numerical evaluation of fl30|) gives a = 0.2766 > 0. 



2.4 Translation Invar iance 

We show that the scattering distance H'S'jfPj]/ — is non- increasing 

when J increases, and thus converges when J goes to 00. It defines a limit 
distance which is proved to be translation invariant. Section [3] studies the 
convergence of Sj[Pj]f when J goes to 00, to the translation invariant scat- 
tering transform Sf. 

Proposition 2.9 For all (/, h) G L2(M'=')2 andJ e Z 

\\Sj+i[Vj+i]f - Sj+i[Vj+i]h\\ < \\Sj[Vj]f - Sj[Vj]h\\ . (38) 

Proof: Any p' G Vj+i can uniquely be written as an extension of a path 
p eVj where p is the longest prefix of p' which belongs to Vj , and p' = p + q 
for some q G Vj+i. The set of all extensions of p eVj in Pj+i is 

P^+i = {p} U {p + 2'r + f}reG+,p"eVj+, . (39) 

It defines a non-intersecting partition of "Pj+i = Upg-p_,Pj_^]^. We shall prove 
that such extensions are nonexpansive: 

J2 \\Sj+i\p']f - Sj+iIpW < \\Sj[p]f - SMhf- (40) 

v'gV 

To later prove Proposition 13. 3[ we also verify that it preserves energy 

E \\sjMr = \\sj[p]fr. (41) 
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Summing (HQ]) on all p gVj proves (155]) . 

Appendix El proves in flT^ that for all g E L^(R'^) 

\\g -k (j)2J+if + ^ \\9'*^^2Jrf = ||5'*02jf • 
rgG+ 

Applying it to g = U[p]f — U[p]h together with U[p]f -k = Sj[p]f and 
Ulplf-k ^2Jt = U[p + 2-^r]f gives 

\\Sj[p]f-Sj[p]hf = \\Sj+,[p]f-Sj+i[p]hr (42) 

+ J2 \\U[p + 2-'r]f -U[p + 2'r]h\\^ 

Since Sj+i[Vj+i]U[p + 2'r]f = {Sj+i[p + 2-'r + p"]}p.^rj^^, and Sj+i[Vj+i]f 
is nonexpansive, it implies 

\\Sj[p]f - Sj[p]hf > WSj+Mf - Sj+MH' 

which proves fHUj) . Since Sj[Vj+i]f preserves the norm, setting h = in f H2|) 
gives 

||5,[p]/f =||5,+i[j9]/f + Yl Ell^^+i[P + 2'^ + P"]/ll'' 

p"GPj+i G+ 

which proves f l4ip . □ 

This proposition proves that — S'j[Pj]/;,|| is positive and non- 

increasing when J increases, and thus converges. Since Sj[Pj] is nonexpan- 
sive, the limit metric is also nonexpansive 

V(/, h) e V{R')' , hm \\Sj[Pj]f - Sj[Pj]h\\ < 11/ - /ill . 

J— >-co 

For admissible scattering wavelets which satisfy f l30|) , Theorem 12.61 proves 
that \\Sj[Vj]f\\ = 11/11 so limj^oo||^j[Pj]/|| = ||/||. The following theorem 
proves that the limit metric is translation invariant. 

Theorem 2.10 For admissible scattering wavelets 

V/ G V{R') Vc G M^ hm \\Sj[Vj]f - Sj[rj]LJ\\ = . 

J— >oo 
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Proof: Since Sj[Pj] = Sj[Vj] and Sj[Vj]f = Aj U[Vj]f 

\\Sj[Vj]LJ - Sj[Vj]f\\ = \\L,AjU[Vj]f - AjU[Vj]f\\ 

< \\LAj-Aj\\\\U[Vj]f\\. (43) 

Lemma 2.11 There exists C such that for all t G C^(M'^) with ||Vr||oo < 
1/2 we have 

\\L,Ajf-Ajf\\<C\\f\\2-' ||r|U • (44) 

This lemma is proved in Appendix [Bl Applying it to r = c and hence ||t||oo = 
|c| proves that 

\\L^Aj - A,j\\ < C 2^-^ \c\ . (45) 
Inserting this in fl43p gives 

\\LcSj[Vj]f - Sj[Vj]f\\ < C 2-' \c\ \\U{Vj]f\\ . (46) 

Since the admissibility condition fl5U]) is satisfied, Lemma 12.81 proves in 
dSZ]) that for J > 1 

2l|f^[^^]/f<('^+l)ll/f+ 11/11^- (47) 

If \\f\\w < oo then it results from (H^ that 

\\L,Sj[Vj]f - Sj[Vj]fr <{{J +1) \\fr + \\f\\l)C'2a-'2-'-^ jc^ 

so hmj^oo \\LcSj[Vj]f - Sj[Vj]f\\ = 0. 

We then prove that limj^oo \\L^Sj[Vj]f-Sj[Vj]f\\ = for all / G V{R'^), 
with a similar density argument as in the proof of Theorem I2.6[ Any / e 
L^(M'^) can be written as a limit of {/n}neN with ||/n|U < oo, and since 
Sj[Pj] is nonexpansive and unitary, one can verify that 

\\LcSj[Vj]f - Sj[Vj]f\\ < \\L,Sj[Vj]fn - Sj[Vj]fJ + 2 11/ - . 

Letting go to oo proves that limj^oo \\LcSj[Pj]f — Sj[Vj]f\\ = 0, which 
finishes the proof. □ 
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2.5 Lipschitz Continuity to Actions of DifFeomorphisms 

This section proves that a windowed scattering is Lipschitz continuous to 
the action of diffeomorphisms. A diffeomorphism of sufficiently close to a 
translation maps x to x — t{x) where r(x) is a displacement field such that 
||Vr||oo < 1- The diffeomorphism action on / G L^(]R'^) is = f{x — 

t{x)). The maximum increment of r is denoted ||Ar||oo = sup(^. „-)g]R2d \t{x) — 
t{u)\. Let Sj be a windowed scattering operator computed with an admis- 
sible scattering wavelet which satisfies (!30l) . The following theorem com- 
putes an upper bound of \\Sj[Vj]Lrf — /|| as a function of a mixed 
(1^L2(R'^)) scattering norm: 

+00 

l|f/[^i]/lli = EllW]/ll • (48) 

m=0 

We denote Vj^m the subset of Vj of paths of length strictly smaller than m, 
and (a V 6) = max(a, b). 

Theorem 2.12 There exists C such that all f G L^{R'^) with \\U[Vj]f\\i < 
00 and all r G C^(R'^) with ||Vr||oo < 1/2 satisfy 

\\Sj[Vj]LJ - Sj[Vj]f\\ < C \\U[Vj]fhK{T) (49) 

with 

K{t) = 2-^||r|U + ||Vr||oo(log^^ V 1) + ||ifr||oo , (50) 

II Vr||oo 

and for all m > 

\\Sj[Vj,^]LJ - Sj[Vj,M\ < CmWfW Kir) . (51) 
Proof: Let [Sj[Vj], U] = Sj[Vj] - S.j[VjI 
\\Sj[Vj]L,f^Sj\Vj]f\\ < \\USj[Vj]f -Sj[Vj]f\\ + \\[Sj[Vj], urn ■ (52) 
Similarly to ( H3l) the first term on the right satisfies 

\\LrSj[Vj]f - Sj[Vj]f\\ < WL^Aj - Aj\\ \\U[Pj]f\\ . (53) 

Since 

(+00 \ +00 

m=0 / m=0 
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it results that 

\\LrSj[Vj]f - Sj[Vj]f\\ < WL^Aj - Aj\\ \\U[Vj]fh . (54) 

Since Sj[Pj] iterates on Uj which is nonexpansive, Appendix |D] proves the 
following upper bound on scattering commutators. 

Lemma 2.13 For any operator L on L^(M'^) 

\\[Sj[Vj],L]f\\<\\U[Vj]fh\\[Uj,L]\\ . (55) 

The operator L = L.^ also satisfies 

\\[Uj,Lr]\\<\\[Wj,L^]\\ . (56) 

Indeed, Uj = MWj, where M{hj, {hx)xeAj} = {hj, {\hx\)xeAj} is a nonex- 
pansive modulus operator. Since MLr = LrM 

\\[Uj,U]\\ = \\Mj[Wj,Lr]\\ < \\[Wj,U]\\ . (57) 

Inserting (155|) with (!56|) and in (!52l) gives 

\\Sj[Pj]L^f - Sj[Vj]f\\ < \\U[Pj]fh(^\\L^Aj-Aj\\ + \\[Wj,L^]\\) . (58) 

Lemma [2.111 proves that ULt-Aj — Aj\\ < C2^'^ ll'^^lloo- This inequality and 
(EH]) imply that 

<C||f/[Pj]/||i(2-^||r|U + ||[W^j,i:.]||) . (59) 

To prove fH9]) , the main difficulty is to compute an upper bound of || [Wj, L^] || , 
and hence of || [IVj, L^]||^ = ||[H/'j,L^]* [iyj,L^]||, where A* is the adjoint of 
an operator A. The wavelet commutator applied to / is 

[Wj,L^]f = {[Aj,U]f , {[W[X\,U]f)x^AA, 

whose norm is 

\\[Wj,LM' = \\[Aj,LM'+ E ll[W^[A],^.]/f. (60) 

agAj 

It results that 

[Wj, LrY [Wj, Lr] = [Aj, LrY [Aj, L^] + J2 MX],LrY [W[X],Lr]. 

AeAj 

The operator [Wj, L^]* [Wj, L^] has a singular kernel along the diagonal but 
Appendix [E] proves that its norm is bounded. 
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Lemma 2.14 There exists C > such that all J e Z and all t G C^(R'^) 

with ||Vr||oo < 1/2 satisfy 

\\[Wj,U]\\<c(\\Vr\\^i\ogl^^yi) + \\Hr\\Jj . (61) 
^ l|Vr||oo / 

Inserting the wavelet commutator bound flMl) in fl5I?]) proves the theorem 
inequahty (H9|) . One can verify that (149|) remains vahd when replacing Vj by 
the subset of paths of length smaller than m: Vj^m = U„<mAj, if we replace 
\\U[Vj]f\\i by \\U[Vj,m]f\\i- The inequality m results from 

m— 1 

iif/[^j,n.]/iii = Eiif^[^:^]/ii^"^ 11/11- (62) 

n=0 

This is obtained by observing that 

wmn < \\u[Arv\\ < 11/11 , (63) 

because [/[A"]/ is computed in by applying the norm-preserving oper- 
ator Uj on U[Ay^]f. □ 

The condition ||Vr||oo < 1/2 can be replaced by ||Vr||oo < 1 if C is re- 
placed by C(l - ||Vr|U)-". Indeed \\Sj[Vj]f\\ = \\f\\ and \\Sj[Vj]LJ\\ < 
11/11(1 — II Vr||oo)"'^- This remark applies to all subsequent theorems where 
the condition ||Vr||oo < 1/2 appears. The theorem proves that the dis- 
tance \\Sj[Pj]Lrf — 5'j[Pj]/|| produced by the diffeomorphism action Lr 
is bounded by a translation term proportional to 2~''||r||oo and a deforma- 
tion error proportional to ||Vr||oo- This deformation term results from the 
wavelet transform commutator [H^j,Lt-]. The term log(|| Ar||oo/|| Vr||oo) can 
also be replaced by max(J, 1) in the proof of Theorem 12.121 For compactly 
supported functions /, Corollary 12.151 replaces this term by the log of the 
support radius. 

If / G L^(]R'^) has a weak form of regularity such as an average mod- 
ulus of continuity in L^(]R'^) then Lemma 12.81 proves that ||f^['Pj]/|P = 
||t/[Aj]/|p is finite. Numerical experiments indicate that ||?7[A}]/|| 
has exponential decay for a large class of functions, but we do not charac- 
terize here the class of functions for which Hf/iPjl/Hi = ll^[^j]/ll 
finite. In audio and image processing applications [U |3], the percentage of 
scattering energy becomes negligible over paths of length larger than 3 so 
(I3T]) is applied with m = 4. 
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The following corollary derives from Theorem 12.121 that a windowed scat- 
tering is Lipschitz continuous to the action of diffeomorphisms over compactly 
supported functions. 

Corollary 2.15 For any compact f2 C M*^ there exists C such that for all 
f e L2(R"') supported in n with \\U[Vj]f\\i < oo and for all r G C^iR"^) with 
||Vr|U < 1/2, zf2J > then 

\\Sj[Vj]LJ - Sj[Vj]f\\ < C \\U[Vj]fh (||Vr|U + ll^^^riu) . (64) 

Proof: The inequality (|64l) is proved by applying (H9l) to a f with Lff = 
Lrf, and showing that there exists C which only depends on Q such that 

2"^ll^lloo + ||Vf||oo(log^^Vl) + ||fff||oo < C"f||Vr|U + ||i^r|U) • (65) 

||Vr||oo V / 

Since / has a support in Q, Lff = L^-f is equivalent to f(x) = t{x) for 
all X E fir = '■ X — t{x) G Q} and f~^{Q) = Q^-- If ^ has a radius R 
then the radius of Qr is smaller than 2R, because ||Vr||oo < 1/2. We define 
f as a regular extension of r equal to r(x) for x & Qr and to the constant 
mmx^n^ outside a compact Qr of radius (4_R + 2) including Qr- It results 
that 

||Af||oo= sup^ |f(a;) -f(u)| < (4i? + 2) ||Vr||oo . (66) 

The extension in — 0,^ can be made regular in the sense that ||Vf||oo + 
||-f^'r||oo < a (ll^''"lloo|| + ||-f^'7"||oo) for some a > which depends on fl. This 
property together with (l66ll proves (1651) . □ 

Similarly to Theorem I2.12[ if Vj is replaced by the subset Vj^m of paths 
of length smaller than m, then is replaced by in flMl) . If 

Lrfix) = f{{l — s)x) with |Vr(a;)| = |s| < 1 then the upper bound flMl) 
is proportional to m |s| ||/||. In this lower bound is simply obtained 

by observing that since \\Sj[Vj]f\\ = \\f\\ and \\Sj[Pj]Lrf\\ = = 
(1-^)-^ 11/11 

\\Sj[Vj]Lrf - Sj[Vj]f\\ > \\\LJ\\ - ll/lll > 2-1 s ll/ll. 

Together with the upper bound flMl) . it proves that if t{x) = sx then the 
scattering distance of / and L^-f is of the order of ||Vr||oo ||/||- 
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The next theorem reduces the translation error term 2 ||r||oo in Theorem 
12.121 to a second-order term 2~^'^||r||^, with first-order Taylor expansion 
of each Sj[p]f. We denote VSj[Pj]f{x) = {V Sj[p]f{x)}p^rj and r(x) • 
VSj[Vj]f{x) = {t{x) ■ VSj[p]f{x)},^Vj. 

Theorem 2.16 There exists C such that all f e L'^iR'^) with \\U[Pj]f\\i < 
oo and all r G C^(M"') with ||Vr||oo < 1/2 satisfy 

\\Sj[Vj]LJ - Sj[rj]f + T ■ VSj[Pj]f\\ < C \\U[Pj]fh K{t) (67) 

with 

Kir) = 2-''\\t\\1 + ||VrlU(log^% V 1) + \\Ht\\^ . (68) 

II Vr||oo 

Proof: The proof proceeds as the proof of Theorem I2.12[ Replacing 
Sj[Vj]Lr - Sj[Vj] by Sj[Vj]Lr - Sj[Vj]+t. \/Sj[Vj] in the derivation steps 
of the proof of Theorem 12.121 amounts to replace L^-Aj — Aj by L^-Aj — Aj + 
VAj. Equation (1551) then becomes 

\\Sj[Vj]LJ-Sj[Vj]f + T.\/Sj[Vj]\\<\\U[Vj]fh ( \\L^Aj-Aj + VAj 

+ mj,L^]\\) . 

Appendix O proves that there exists C > such that 

WLrAjf -Aj + VAjW < C2-^' \\r\\l . (69) 

Inserting the upper bound ( 1611) of || [W^j, L,-] || proves ( 1671) . □ 

If 2"^ ^ ||t||oo and ||Vr||oo + ||-f^T||oo ^ 1 then K{t) becomes negligible 
and r(x) can be estimated at each x by solving the system of linear equations 
resulting from (167|) : 

ypeVj , Sj[p]Uf{x)-Sj[p]fix) + Tix).VSj[p]f{x)^0. (70) 

In dimension d, the displacement r(x) has d coordinates which can be com- 
puted if the system (170|) has rank d. Estimating t{x) has many applications. 
In image processing, the displacement field t{x) between two consecutive im- 
ages of a video sequence is proportional to the optical flow velocity of image 
points. 
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3 Normalized Scattering Transform 



To define the convergence of Sj\Pj\, all countable sets Vj are embedded in 
a non-countable set Voo- Section [3A] constructs a measure /z and a metric in 
Voo- Section [32] redefines the scattering transform Sf as limit of windowed 
scattering transforms over Voo, with Sf G L'^{Voo,dfi) for / G L^(]R'^). Nu- 
merical comparisons between Sf and |/| are given in Section I3l3l 

3.1 Dirac Scattering Measure and Metric 

A path p E v.] can be extended into an infinite set of paths in Vj+i which 
refine p. In that sense, Pj+i is a set of higher resolution paths. When J 
increases to oo, these progressive extensions converge to paths of infinite 
length, which belong to an uncountable path set Voo- A measure and a 
metric are defined on Poo- 

A path p = (Ai, Am) of length m belongs to the finite product set A^ 
with Aoo = 2^ X G~^. An infinite path p is an infinite ordered string which 
belongs to the infinite product set A^. For complex- valued functions, adding 
negative paths (— Ai, A2, A^) doubles the size of A^ and A^. We concen- 
trate on positive paths (Ai, A2, A^) and the same construction applies to 
negative paths. Since Aoo = 2^ x is a discrete group, its natural topology 
is the discrete topology where basic open sets are individual elements. Open 
elements of the product topology of A^ are cylinders defined for any A G Aoo 
and n > by C„(A) = {q = {qk}k>o e A^ : g„+i = A} [22J. Cylinder sets 
are intersections of a finite number of open cylinders: 

Cn(Al, Xm) = {*? G A^ : Qn+l = Ai, qn+m = Am} = ;^C„+j(Aj) . 

As elements of the topology, cylinder sets are open sets but are also closed. 
Indeed the complement of a cylinder set is a union of cylinders and is thus 
closed. As a result, the topology is a sigma algebra, on which a measure n 
can be defined. The measure of a cylinder set C is written ^{C). 

Let Voo be the set of all finite paths including the path: Voo = U^gnA™. 
To any p = (Ai, Am) G Voo, we associate a cylinder set: 

C{p) = Co{p) = {g G A~ : gi = Ai, gm = Am} • 

This family of cylinder sets generates the same sigma algebra as open cylin- 
ders since open cylinders can be written C„(A) = U(Ai,...,A„)GAg„C(Ai, A„, A). 



26 



The following proposition defines a measure on from the scattering of a 
Dirac: 

U[p]5= I IIV'AihV'Aah.-h^A^I • 

Proposition 3.1 There exists a unique a -finite Borel measure fi, called 
Dirac scattering measure, such that /i(C(p)) = ||t/[j9]5f for all p e Poo- 
For all 2^g G Aoo and p G Poo, f^{C{2^gp)) = 2'^^^i{C{p)) 
If \ip{oj) \ + !■?/)(— a;)! 7^ almost everywhere then \\U[p\5\\ ^ forp G Poo- 

Proof: The Dirac scattering measure is defined as a subdivision measure 
over the tree that generates all paths. Each finite path p corresponds to 
a node of the subdivision tree. Its sons are the {p + AIa^a^, and C{j)) = 
UagAcx)C*(p + A) is a non- intersecting partition. Since 

\\U[p]6f = \\UjU[p]6r = J2 IMp + M^', 

agAoo 

it results that fi{C{p)) = XIagAoo ''^(^(^ '^))" '^^^ sigma additivity of the 
Dirac measure over all cylinder sets results from the tree structure, and 
the decomposition of the measure of a node as a sum of the mea- 

sures /i(C(p + A)) of all its sons. This subdivision measure is uniquely 
extended to the Borel sigma algebra through the sigma additivity. Since 
A~ = UxeA^C{X) and /i(C(A)) = ||t/[A]5||2 = HV^^P, this measure is a-finite. 

We showed in that U[p]{2^gof) = 2^goU[2-^gp]f. Since 2^goS = 2-'^^ 5 
it resuhs \\U[2-^gp\6f = 2~'^^\\U[p]6f and hence /i(C(2V)) = S'^VlCW)- 

If the set of a; G M'^ where ip{uj) = and iIj{—u) = is of measure 0, let us 
prove by induction on the path length that U[p]f ^ if f e L'^{R'^)UL^{R'^) 
or if / = S. We suppose that U[p]f ^ and verify that f/[p + A]/ ^ for any 

A gAoo Since ?7[p]/ is real, |f7R/MI = But V^^M = V^IA'M, 

so i^xiyj) and ^\{—uj) vanish simultaneously on a set of measure 0. It results 

that W^\\f = UWi'x ^ if U\p\f ^ so U[p + A]/ is a non-zero 
function. □ 

A topology and a metric can now be constructed on the path set A^. 
Neighborhoods are defined with cylinder sets of frequency resolution 2'^: 

Cj{p) = U A6A^ C{p + A) c C{p) . (71) 

\\\<2-J 

Clearly Cj+i(p) C Cj{p). The following proposition proves that fi{Cj{p)) 
decreases at least like 2~'^'^ when 2*^ increases, and it defines a distance from 
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these measures. The set of infinite paths is not complete with this metric. 
It is completed by embedding the set Voo of finite paths, and we denote 
Poo = Voo U the completed set. This embedding is defined by adding 
each finite path p G "Poo to C{p) and to each Cj{p) for all J G Z, without 
modifying their measure. We still denote Cj{p) the resulting subsets of Poo- 
For complex valued functions, the size of Poo is doubled by adding finite and 
infinite negative paths (— Ai, A2, A^, •••)• 



Proposition 3.2 If p E Poo is a path of length m then 

f^iCjip)) = \\sj6[p]r<2-'^'\m'\mr. (72) 

Suppose that \ + |'?/'(— 7^ almost everywhere. For any q q' E Voo 

d{q,q') = inf fi{Cj{p)) and d{q,q)=0 (73) 

(<?,<7')GC,j{p)2 

defines a distance on Voo, and Voo is complete for this metric. 
Proof: According to fTTTl) 

p^{Cj{p))= J2 MCb + A))= Yl \\u\p + m\'- 

AeAcx) AeAoo 
A|<2--^ |A|<2--' 

Since U[p+X]6 = Ulpjd-kipx and |02j('^)P = J2 ^^Aoo |-?/'a(w)P, the Plancherel 

|A|<2-J 

formula implies 

MCj(p))= Yl \\uW^M' = \\uW^M\' = \\Sj[p]5r . 

AeAoo 
A|<2-J 

Since [p] 5 = U[p]6-k(l)2J , Young's inequality implies ||5'j[p]5|| < ||02-'| 
Moreover ||[/[A]/||i < ||'0a||i||/||i with Wi'xWi = W'ipWi, so we verify by induc- 
tion that ||f/[p]5||i < ll^ir. inserting ||02^f = 2-'^-^||0f proves ([72]). 

Let us now prove that d defines a distance. If g 7^ q', we denote p G 
Poo their common prefix of longest size m, which may be 0, and show that 
d{q, q') 7^ 0. Let |q'm+i| = 2-'"'+^ and Ig^+il = 2^^+'^ be the frequencies of their 
first different coordinate. If 2"-^ = max(|gm+i|, \q'm+i 

I) then (g,g') G Cj{pf 

and it is the smallest set including both paths so d{q,q') = fi{Cj{p)). It 
results that d{q,q') 7^ because fi{Cj{p)) > fi{C{p + 2"^r)) for r G and 
Proposition 13. II proves that fi{C{p)) 7^ for all p G Poo, so d{q, q') 7^ 0. 
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The triangle inequality is proved by showing that 

V(g, g', q") e V'l , d{q', q") < max(rf(g, q') , d{q, g")) . (74) 

This is verified by writing d{q,q') = fi{Cj{p)), d{q',q") = ft{Cj>{p')) and 
d{q',q") = l^{Cjii{p")). Necessarily p is a substring of p' or vice versa, and 
p" is larger then the smallest of the two. If p" is strictly larger then the 
smallest say p, then li{Cjii{p")) < fi{C{p")) < Cj{p), so fl71|) is satisfied. If 
p" = p = p' then 2~'^ < max(2~"^, 2~'^ ) and (174|) is satisfied. Otherwise 
p" = p is strictly smaller than p' and necessarily 2-^ = 2-^ so (174ll is also 
satisfied. 

To prove that Voo is complete, consider a Cauchy sequence {gjjjeN in Voo- 
Let pk be the common prefix of maximum length among all qj for j > k. 
It is a growing string which either converges to a finite string p G Voo if '^fc 
is bounded or to an infinite string p G A^. Among all paths {qj}j>k whose 
maximum common prefix with p has a length rrik, let gj^. be a path whose 
next element Xmk+i has a maximum frequency amplitude |Amj,+i|. One can 
verify that 

sup d{qj,qj') = d{qj^,p) = sup d{p,qj) . 

j,j'>k j>k 

Since supj ji^f, d{qj, qj/) converges to as /c increases, it implies that supj>;j d{p, qj) 
also converges to and hence that {gjjjGN converges to p. □ 



3.2 Scattering Convergence 

For h G L'^(Voo,dfi), we denote \\h\\^ = f^p \h{q)\'^ dfi{q) , where /i is the 
Dirac scattering measure. This section redefines the scattering transform 
Sf as a limit of windowed scattering transforms, and proves that Sf G 
L^('Poo, dfi) for all / G L^(M'^). We suppose that ip is an admissible scattering 
wavelet, and that |'?/'(c(;)| + |^/'(— u;)| ^ almost everywhere. 

Let lcj(p)(g) be the indicator function of Cj{p) in Voo- A windowed 
wavelet scattering Sj[Vj]f{x) = {Sj[p]f{x)}p^-pj is first extended into a 
normalized function of (g, x) G Voo x 



29 



It satisfies Sjf{p,x) = Sj[p]f{x)/\\Sj[p]S\\ for p G Poo- Since n{Cj{p)) 
for all G L2(M^)2 

/ / |5j/(g,x)-5j/i(g,a;)pdMg)c?x= ||5j[Pj]/-5j[Pj]/if < 
_ / |5'j/(g,a;)|^ii/i(g)(ix = ||5'j[Pj 



SO Sjf{q,x) can be interpreted as a scattering energy density in Voo x IR'^- 

The windowed scattering Sjf{q,x) has a spatial resolution 2""' along 
X and a resolution 2-^ along the frequency path q. When J goes to oo, 
Sjf{q,x) loses its spatial localization, and Theorem 12.101 proves that the 
asymptotic metric on Sj[Vj]f and hence on Sjf{q, x) is translation invariant. 
The convergence of Sjf{q,x) to a function which depends only on g G Voo 
is studied by introducing the marginal L2(M'^) norm of Sjf{q,x) along x for 
q fixed: 

VgGPoo , Sjfiq) = I |^^/(g,x)|^dx= |^^Mffllc,(rt(g). (76) 

It is a piecewise constant function of the path variable q, whose resolution 
increases with J. Since fi{Cj{p)) = ||5'j[p]5||2, 

\\Sjf~Sjh\\'^^ = [_ \Sjf{q)-Sjh{q)\'dfi{q) = \\\Sj[p]f\\-\\Sj[p]h\ 

(77) 

The following proposition proves that Sj is a nonexpansive operator which 
preserves the norm. 

Proposition 3.3 For all (/, h) G L'^{R'^f andJ e Z 

\\Sjf - Sjhy^ < - Sj+,h\\v^ , (78) 

\\Sjf - Sjhy^ < wsjIVjU - Sj[Vj]h\\ <\\f-h\\, (79) 

WSjfh^, = 11/11 . (80) 
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Proof: We proved in fl41l) that 



\sjwr 



(81) 



where Pj+i = Upe-p^Pj_,_]^ is a disjoint partition. Applying this to / and h 
imphes 

\\sj[pm-\\sj[p]h\\\' < J2 \\\sj+i[p']f\\-\\sj+,[p']h\\\\ 

Summing over p eVj and inserting (!77|) proves ( ITHj) . 

Since 1 1 [p] / 1 1 — 1 1 [p] /i 1 1 < 1 1 •S'j [p] / — [p] /i 1 1 , summing this inequahty 

over p eVj and inserting (177|) proves the first inequahty of (!79|) . The second 
inequality is obtained because Sj\Pj] is nonexpansive. Setting /i = proves 
that PjIIp^ = \\Sj[Vj]f\\ and Theorem ESI proves = ||/||, which 

gives dHOD-P _ 

Since \\Sjf — S'j/i||p^ is non-decreasing and bounded when J increases, 
it converges to a limit which is smaller than the limit of the non- increasing 
sequence — Sj\Pj\h\\. The following proposition proves that Sjf 

converges pointwise to the scattering transform on Voo introduced in Defini- 
tion |231 



Proposition 3.4 If f eL^ 

\/p G V, 



then 



oo , lim Sjf{p) = Sf{p) = — 



U[p]f{x) dx 



(82) 



with fJ'p = J U[p]6{x) dx. 



Proof: If p G Voo then for J sufficiently large Sjf{p) = \\Sj[p]f\\/\\Sj[p]S\\. 
Let us prove that 



lim 2'''/'\\Sj[p]f\\ 



J— >oo 



U[p]f{x) dx 



(83) 



and that this equality also holds for f = S. Since Sj\p]f = U\p]f -k 02-^, the 
Plancherel formula implies 



\Sj[p]f\\' 



2tt) 



\U[p]f{co)\'m'co)\'dco 
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Since derivatives of are in L^(R'^), we have 4>{uj) = 0((1 + \uj\) ^) and 
hence (27r) "'^2'^"^ 10(2-^0;) p converges to Moreover, if / G Li(R'^) 

then U[p]f G L^(R'^) so U[p]f{u) is continuous at w = 0. It results from (IMI) 

that hmj_,oo2'^^||Sj[p]/f = \U\p\f i*^)? UW'^ which proves ([HS]). The same 
derivations hold to prove this result for f = 6. 

Since |-?/'(a;)| + |'?/'(— a;)| 7^ almost everywhere, Proposition l3. ll proves that 
U\p]6 7^ 0. Since it is positive, it has a non-zero integral. It results from (1831) 
that limj^oo ||'S'j[p]/||/||5'j[p]5|| = J U[p]f{x)dx/ J U[p]6{x)dx which proves 

(IH2D. □ _ _ 

The scattering transform Sf can now be extended to Voo as a windowed 
scattering limit: 



VgGPoo , ^/(g)=liminf^j/(g) . 

J— >oo 



Proposition 13.31 proves that = ||/|| so Fatou's lemma implies that 

5*/ G L'^{Voo, dfi). The following theorem gives a sufficient condition so that 
Sjf converges strongly to Sf, which then preserves the L^(M'^) norm of /. 



Theorem 3.5 IfforfeL 



there exists Q-^j C Vj with 



lim \\Sj[n^j] 

J— >oo 



and lim sup 



Sj[p]f Sj[p]S 



ISjWW WSjWW 



(85) 



then Sjf converges in norm to Sf with \\Sf\\ 



and 



\Sjf{q)\'dM = \\U[p]f\[' 



C{P) 



(86) 



If{f,h) G L\Wy satisfy ^ then 



lim \\Sj[Pj]f - Sj[Vj]h\\ = 115/ - shy^ 



(87) 



If (E3) is satisfied in a dense subset o/L^(]R'^) then Sjf converges strongly 
to Sf for all f G L2(R'^) and both (ED and ^ are satisfied m L2(R'^). 



Proof: The following lemma proves that {Sjf}j(zfq is Cauchy and hence 
converges in norm to Sf G L^(Poo5 dfi). The proof is in Appendix [Fl 



32 



Lemma 3.6 If f E L^(]R'^) satisfies l[8^) then {5'j/}jgN is a Cauchy se- 
quence in 'L^iVoo.djj). 

Since L'^(Voo,dfJ') is complete, Sjf{q) converges in norm to its limit 
inf 5/. Since \\Sjf\\ = \\f\\, it also implies that \\Sfy^ = \\f\\. More- 



over, U[p + q] = U[q]U[p] so WSjUmil,^ = Jct^^)\Sjfiq)\'dM. Since 
= taking the limit when J goes to oo proves (186|) . 

/ oo 

The windowed scattering convergence (187|) relies on the following lemma. 



Lemma 3.7 If{f,h) G L'^iR'^f satisfy (E^j then 

hm \\Sj[Vj]f - Sj[Vj]h\\ = hm \\Sjf - Sjh\y^ 



J— >oo 



J— >oo 



(88) 



Since (185!) implies that S'j/ and Sjh respectively converge in norm to Sf 
and Sh, the convergence (1871) results from (l88l) . Proving (IHH!) is equivalent 
to proving that hmj^^o EpePj ^.^(Z' = fo'^ 



/j(/,/^)[p] = ii5j[p]/-5j[p]/iir 

Observe that 

Ij{f,h)[p] = \\Sj[p]f\\\\Sj[p]h 

< 2\\Sj[p]f\\\\Sj[p]h\\ 



\Sj\p]f\\ - WSjIpM 



Sj[p]f Sj[p]h 



Sj[p]f\\ \\Sj[p]h 
Sj[p]f Sj[p]6 



+ 



Sj[p]f\\ \\Sj[p]6\\ 
Sj[p]h Sj[p]S 



(89) 



\Sj[p]h\\ \\Sj[p]6\\ 



When summing over p E Vj, we separate iljU^lj from its complement in Vj. 
Since hmj^oo \\Sj[n^j]f\\' = 0, \\Sj[Pj]f\\' = \\f\\^ limj^^ \\Sj[n'}]hf = 0, 
and \\S,j[Vj]hf = ||/if , dividing the sum over Qj and flj and applying 
Cauchy-Schwartz proves that 



lim Yl \\Sj\p]f\\\\Sj[p]h\\=0 
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and \\Sj\p]f\\ \\Sj[p]h\\ < \\f\\ \\h\\. The hypothesis m apphed to / 

and h gives 

hm sup 



Sj[p]f Sj[p]5 



\Sj[p]f\\ WSjIpM 



Sj[p]h Sj[p]S 



\Sj\p]h\\ WSjWW 



so (l89l) imphes that hmj_^oo J^peVj ^jif' ^)[p] — 0> which finishes the Lemma 
proof. 

Suppose that (185|) is satisfied in a dense subset of L^(M'^). Any / e L^(R'^) 
is the hmit of {fn}n>o in this dense set. Since 5* and Sj are nonexpansive 

\\Sf - Sjfy^ < 2 11/ - + - SjfJ^^ . 

Since /„ satisfies ( 185|1 . we proved that Sjfn converges in norm to Sfn- Letting 
n go to oo imphes that Sjf converges in norm to S. The previous derivations 
then imphes that both (15^ and ( 1S7|) are satisfied in L^(M'^). □ 

If / e LHK'^) and p G Poo, since Sj[p]f{x) = U[p]f^(j),j and \\U[p]f\\i < 
oo, applying the Plancherel formula proves that 



lim 

J— >oo 



Sj[p]f Sj[p]6 



\Sj[p]f\\ \\Sj[p]6\\ 







(90) 



This is however not sufficient to prove ( l85l) because the sup is taken over all 
p E Vj — Qj which grows when J increases. For / G L^(]R'^), one can find 
paths p J G Vj, which are not frequency-decreasing, where >S'j[pj]//||5'j[pj]/|| 
does not converge to S'j[pj]5/||S'j[pj]5||. The main difficulty is to prove that 
over the set Qj of all such paths, a windowed scattering transform has a 
norm ||S'j[f2j]/|| which converges to zero. Numerical experiments indicate 
that this property could be valid for all / G L^(]R'^). It also seems that if 
/ G L^(M'^) then Sf{q) is a continuous function of the path q, relatively 
to the Dirac scattering metric. This is analogous to the Fourier transform 
continuity when / G L^(M'^). 

Conjecture 3.8 Condition ( fg3]) holds for all f G L^(M'^). Moreover, if 
f G L^(M'^) then Sf{q) is continuous in Voo relatively to the Dirac scattering 
metric. 

If this conjecture is valid, since L^(M'^) is dense in L^(M'^), then Theo- 
rem [32] proves that Sj converges strongly to Sf for all / G L^(]R'^), and 
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WSfWr^ =J/||. Property m also proves that \\Sj[Vj]f - Sj[Vj]h\\ con- 
verges to \\Sf — Sh\\^^ as J goes to oo. Through this hmit, the Lipschitz 
continuity of Sj under the action of diffeomorphisms can then be extended 
to the scattering transform S. 

3.3 Numerical Comparisons with Fourier 

Let M'^"'" be the half frequency space of all u = {ui, Ud) G M.'^ with ui > 
and Wfc G K for k > 1. To display numerical examples for real functions, the 
following proposition constructs a function from ]R'^+ to Voo which maps the 
Lebesgue measure of R"'"'' into the Dirac scattering measure. It provides a 
representation of 5*/ over M'^"'". We assume that is an admissible scattering 
wavelet, and that + ^ almost everywhere. 

Proposition 3.9 There exists a surjective function q{uj) from onto Voo 
such that for all measurable sets Q C Voo 



Proof: The proof first constructs the inverse g by mapping each cylin- 
der C{p) for p G Voo into a set q~^{C{p)) C M'^"'" satisfying the follow- 
ing properties: /i(C(p)) = J^_,^^^^^^duj, and q'\C{p)) n q-\C{p')) = 
if C{p) n Cjp') = 0, and q-^{C{p)) C q-\Cip')) if C Let 

q^^{C{p)) be the closure of q~^{C{p)) in M'^"'". For all p 7^ 0, we also im- 
pose that the frontier of q~^{C{p)) is a set of measure in R'^^, and that 
q~^{C{p + A)) C q~^{C{p)) for all A G Aoo- The cylinders C{p) generate the 
sigma algebra on which the measure n is defined. A measurable set Q can be 
approximated by sets Qk which are union of disjoint cylinder sets C{p) with 
limfc^oo/^(^ — ^k) = 0. The properties of q~^ on the cylinders C{p) imply 
that du = fi{^k) and when k goes to 00 we get (1911) . 

Once all q^^{C{p)) are constructed, the inverse q{uj) is uniquely defined 
for all CO G R'^^, as follow. Let Pm be the prefix of g G Voo of length m. We 
define q-^{q) = Hm^q'^C (Pm)) ■ Since q-^{C{p + \)) C q~ \C{p)) for all 
A G Aoo, it results that rimeNq"^ jC (Pm )) = ^meNq~^ iC{pm)). It is a closed 
non-empty set because q^'^{C{pm)) C q^^ {C (pm-i)) is a non-empty set of 
measure ||[/[pm]<5|| 7^ 0. We verify that q{ijj) = q for all u G q~^{q) defines 
a surjective function on R''"'" by showing that U^gp^g~^(g) is a partition of 




doo . 



(91) 
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jf -p^ jg ^jjg Qf pg^^jj Qf length m then Upgp^C(p) is a partition 
of Poo, SO the recursive construction of imphes that Up(='p^q^^{C{p)) is 
a partition of Letting m go to infinity proves that U^gp^g~^(g) is a 

partition of M'^"'". 

The sets ?~^(C(]9)) satisfying the previously mentioned properties are 
defined recursively on the path length, with a subdivision procedure. In 
dimension d = 1, each q~^{C{p)) is recursively defined as an interval of 
]R+. We begin with paths p = 2^ of length 1 by defining q~^{C{2^)) = 
[2-'||7/'||2,2J+i||^P), whose width is 2^||V^p = /u(C(2J)). Suppose now that 
q^^{C{p)) is an interval of width equal to fi{C{p)). AU q-^{C{p + 2^)) for 
J e Z are defined as consecutive intervals [uj, Oj-i), which define a partition 
of q-\C{p)) = Ujezlaj, with aj^i - a, = \\U[p + 2^5f = fi{C{p + 2^)). 
One can verify that this recursive construction defines intervals q~^{C{p)) 
which satisfy all mentioned properties. Moreover, in this case the resulting 
function q^cu) is bijective from to Voo- 

In higher dimensions d > 1, this construction is extended as follow. All 
cylinders C(A) for all paths p = \ = 2^r of length 1 are mapped to non- 
intersecting hyper-rectangles g~^(C(A)) of measure 

/ duj = /^(C(2^r)) = \\U[2^r]6\\^ = 2''^ ||V^f . 

These hyper-rectangles are chosen to define a partition of , and hence 
= UAgAoc5"^(C(A)) with q-\C{X)) n q-\C{\)) = for A 7^ A'. Sup- 
pose now that q~^{C{p)), with Jq-i(^c{p)) ~ \W\p\^\\'^^ defined for all paths 
p of length m. Since U preserves the norm ^_xgAoo II^Ip + '^I^P ~ ll^[p]^IP- 
We can thus partition q^^{C{p)) into subsets {q^^{C{p + A))}AeAoo with 
/g-i(c(p+A) ~ \W[P + whose frontiers are piecewise hyperplanes of 

dimensions d — 1 and hence have a zero measure. 

The property g~^(C(p + A)) C q'^{C{p)) for all A e Aoo is obtained 
with a progressive packing strategy. We first construct q~^{C{p-\- A)) for all 
A = 2^r with j > 0, by defining a partition of a closed subset of q~^{C(p)) 
of measure XIagA |a|>i + The remaining q~^{C{p + X)) are then 

progressively constructed for A = 2^r and j going from —1 to — cxo, within the 
remaining closed subset of q~^{C{pj) not already allocated. This is possible 
since we guarantee that the frontier of each q~^{C{p)) has a zero measure. 
□ 
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The function q{u)) maps the Lebesgue measure into the Dirac scattering 
measure, but it is discontinuous at all u G M'^"'" such that G "Poo- Indeed 
these u are then at a boundary of the subdivision procedure used to construct 
q{u). As a result, if u and u' are on opposite sides of a subdivision boundary- 
then they are mapped to paths q{u) and q{uj') whose distance d[q{uj), q{oj')) 
does not converge to as |a; — uj'\ goes to 0. 

Measure preservation flUT]) implies that q{(jj) defines a scattering function 
'Sf{q{uj)) G L2(M^+) with 



\\Sf{q{i^))\\i.. = / \Sf{q{ujWduj = _ \Sf{q)\'dM = \\Sf\\^^ . 
Jr''+ JVoo 

If / is a complex-valued function, then Voo is a union of positive paths 
q = (Ai, A2, A3...) and negative paths —q = (— Ai, A2, A3...). Setting g(— w) = 
~q{ijj) defines a surjective function from to Voo which satisfies ( 19T1) . It 
resultsjhat Sf{q{-uj)) = Sf{-q{uj)) for all to G and Sf{q{uj)) G V(R'^) 
with ||^/(gM)|| = ||^/||^_^. 

If / satisfies fl85|) then Sf{q{uj)) and |/(u;)| have an equivalent decay over 
dyadic frequency bands, because their norm is equal over these frequency 
bands. Indeed, for a frequency band A = 2^r of radius proportional to |A| = 
2^, the measure preservation ( lOTl) together with ( l86l) prove that ||?7[A]/|| = 
11/ satisfies 

/ \Sf{q{co))\'du: = ||f/[A]/f = ^ / l/MP mX-'co)\'du: . (92) 

If Conjecture 13.81 is valid then this is true for all / G L^(]R'^). In dimension 
d= 1, q-\C{2^)) = [IIV-II^S^ II II ^2^+^) and |V'(2Ml is^ non-negligible on a 
similar dyadic frequency interval. Hence Sf{q{uj)) and |/(u;)| have equivalent 
energy over dyadic frequency intervals. 

Figure [2](c,d,e) illustrates the convergence of the windowed scattering 
transform Sjf{q{u)) when J increases, for a Gaussian second derivative /. 
Sjf{q{co)) is constant if q{oj) = p is constant and hence if w G q^^{Cj{p)). 
The frequency interval q'^{Cj{p)) has a width fi{Cj{p)) = ||S'j5[p]|p, which 
goes to zero as J goes to 00 as shown by ( 1721) . When J increases, each 
q~^{Cj{p)) is subdivided into smaller intervals q~^{Cj+i{p')) corresponding 
to paths p which are prolongations of p. For each u, the graph color specifies 
the length of the path p = q{u). At low frequencies, q{u!) = is shown as 
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Figure 2: (a): Fourier modulus |/(w)| of a Gaussian second derivative, as a 
function of w G [0,2]. (b,c,d): Piecewise constant graphs of Sjf{q{u})), as a 
function of a; G [0,2]. The color specifies the length of each path q{u): is 
yellow, 1 red, 2 green, 3 blue, 4 violet. The frequency resolution 2"^ increases 
from (b) to (c) to (d), and Sjf{q{uj)) converges to a limit function Sf{q{u)). 



a yellow interval. Paths q{u)) of length 1 to 4 are respectively coded in red, 
green, blue and violet. 

In these numerical examples, the total energy of Sjf{q{u)) on frequency- 
decreasing paths q{u}) is about 10^ times larger than the energy of scatter- 
ing coefficients on all other paths. We thus only compute Sjf{q{uj)) for 
frequency- decreasing paths, with an O(A^logA^) filter bank algorithm de- 
scribed in [13]. It is implemented with the complex cubic spline Battle- 
Lemarie wavelet ip. As expected from flU^ . Sjf{q{u))f has an amplitude 
and a frequency localization which is similar to the Fourier modulus |/(ci;)| 
shown in Figure [2](a). The discontinuities of S{q{u))f along oj are produced 
by the discontinuities of the mapping q{u), as opposed to discontinuities of 
S{q)f relatively to the scattering metric in Poo- 
Figure |3] compares S{q{u))fi and |/j(c(;)| for four functions fi with 1 < 
z < 4. For /i = l[o,i], the first row of Figure [3] shows that = + 
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(b) (c) 

Figure 3: (a): Each row 1 < i < 4 gives an example of function fi{x). (b): 
Graphs of the Fourier modulus \fi{ijj)\, as a function of co. (c): Graphs of the 
scattering Sfi{q{u)), as a function of u. 



has the same decay in u as Sfi{q{u)). The second row corresponds 
to a Gabor function /2(a;) = e*^^'e~^' and the third row shows a small 
scaling fs^x) = /2((1 — s)x) with s = —0.1. The support of /3(c<;) = (1 — 
s)~^/2((l — s)^^uj) is shifted towards higher frequencies relatively to the 
support of /2. A numerical computation gives III/2I — I/3III = C \s\ II/2II with 
C = 13.5. As shown by ([3]), the constant C grows proportionally to the 
center frequency C, of /2. It illustrates the instability of the Fourier modulus 
under the action of diffeomorphisms. On the contrary, the scattering distance 
remains stable. We numerically obtain ||S'/2 — Sf^W = C \s\ II/2II, with C = 
1.5, and this constant does not grow with ^. It illustrates the Lipschitz 
continuity of a scattering relatively to deformations. In the fourth row, /4 
is a sum of two high-frequency Gabor functions, and |/4(w)| includes two 
narrow peaks localized within the support of f^. The wavelet transform has a 
bad frequency localization at such high frequencies, and can not discriminate 
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the two frequency peaks of from f^. However, these two frequency peaks 
create low frequency interferences, which appear in the graph of /4, and 
which are captured by second order scattering coefficients. As a resuh, 5/4 
is very different from Sfs, which illustrates the high frequency resolution of 
a scattering transform obtained through interferences. 

4 Scattering Stationary Processes 

A scattering defines a representation of stationary processes in P(Poo), hav- 
ing different properties than a Fourier power spectrum. The Fourier power 
spectrum depends only on second-order moments. A scattering transform 
incorporates higher-order moments that can discriminate processes having 
same second-order moments. Section 14.21 shows that it is Lipschitz continu- 
ous to random deformations, up to a log term. 

4.1 Expected Scattering 

The properties of a scattering transform in L^(M'^) are extended to stationary 
processes X{x) with finite second-order moments. The role of L^(M'^) norm 
on functions is replaced by the mean square norm E{\X{x)\'^Y^'^ on stationary 
stochastic processes, which does not depend upon x and is thus denoted 
Convolutions as well as a modulus preserve stationarity. If X{x) 
is stationary, it results that U[p]X{x) is also stationary and its expected value 
thus does not depend upon x. 

Definition 4.1 The expected scattering transform of a sationary process X 
is defined for all p = (Ai, Am) G Voo by 

SX{p) = E{U\p\X) = Ei\ \X^^,,\^...\^^,J). 

This definition replaces the normalized integral of the scattering trans- 
form (ITSl) by an expected value. The expected scattering distance between 
two stationary processes X and Y is 

\\SX-SYf= J2 \SX{p)-SY{p)\\ 

p€Voo 

Scattering coefficients depend upon normalized high order moments of 
X. This is shown by decomposing 

\U[p]Xix)\' = Ei\U[p]X\'){l + eix)) . 
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A first-order approximation assumes that |e| ^ 1. Since J ip\{x)dx = 0, 
a nd U[ p]X{x) = ^/\U\p\X{xj^, computing U[p + X]X = \U[p\Xi.^x\ with 
^/lTe ^ 1 + e/2 gives 

+ ^ 2Ei\U[p]X\^y/^ ■ ^^^^ 

Iterating on fl93l) proves that SX{p) = E{U[p]X) for p = (Ai, Am) depends 
on normahzed moments of X of order 2™, successively filtered by the wavelets 
tpXi^ for 1 < k < m. 

The expected scattering transform is estimated by computing a windowed 
scattering transform of a realization X{x): 

Sj[rj]x = {Sj[p]x}p^rj with Sj[p]x = u[p]x^4>,j. 

Since / (t)2j{x)dx = 1, it results that E{Sj[p]X) = E{U[p]X) = SX{p). So 
Sj[Vj]X is an unbiased estimator of {SX{p)}p^pj. 

The autocovariance of a real stationary process X is denoted 

i?X(r) = e(^{X{x) - E{X)) {X{x - r) - E{X))^ . 

Its Fourier transform RX{oi}) is the power spectrum of X. The mean-square 
norm of Sj[Vj]X = {Sj[p]X}p^'Pj is written 

E{\\Sj[Pj]Xr)=J2E{\Sj\p]X\')- 

peVj 

The following proposition proves that Sj[Vj]X and SX are nonexpansive and 
that SX G l^('Poo)- The wavelet ip is assumed to satisfy the Littlewood-Paley 
condition 

Proposition 4.2 // X and Y are finite second-order stationary processes 
then 

E{\\Sj[Vj]X - Sj[Vj]Yf) < Ei\X - Y\') , (94) 
\\SX -SYf <E{\X -Y\^) (95) 

and 

\\SXf < E{\Xf) . (96) 
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Proof: We first show that the wavelet transform WjX = {AjX, {W[\]X)xeAj} 
is unitary over stationary processes. Let us denote 

E{\\WjXf) = E{\AjX\') + J2 E{\W[\]X\') . 

agAj 

Both AjX = X'k(f)2J and Vr[A]X = X-kipx are stationary. Since J 4'2-'{x) dx = 
1 and / tpxix) dx = it results that E{AjX) = E{X) and E{W[X]X) = 0. 
Since the power spectrum of AjX and ly [AjX is respectively RX{u) |0(2"'a;) p 
and RX{uj) \il)\{uo)\^, we get 

E{\AjX\^) = j RX{uj)\^{2^uj)\^ du + E{Xf 

and 

E(|W^[A]Xn = j RX{uj) \i,x{oj)\''dLO. 

Since i?(|Xp) = J RX{uj) du + E{XY, the same proof as in Proposition 12.11 
shows that the wavelet condition ([9]) implies that i?(||iyjX|p) = E{\X\'^). 
The propagator UjX = {AjX, {\W[\]X\)x(z/^j} satisfies 

E{\\UjX - UjYf) < E{\\WjX - WjY\\') = E{\X - Y\^) 

and is thus nonexpansive on stationary processes. We verify as in fl25|) that 

UjU[A^]X = {Sj[A^]X, U[Ay+']X} . 

Since Vj = U^'^qAj , one can compute Sj[Vj]X by iteratively applying the 
nonexpansive operator Uj. The nonexpansive property is derived from 
the fact that Uj is nonexpansive, as in Proposition 12.51 

Let us prove Since SX{p) = E{Sj[p]X) and SY{p) = E{Sj[p]Y) 

\SX{p) - SY{p)f < E{\\Sj[Vj]X - Sj[Vj]Yf) < E{\X - Y\') . 

peVj 

Letting J go to oo proves fl95|) . The last inequality fl96|) is obtained by setting 
Y = 0. D 

Paralleling the scattering norm preservation in L^(M'^), the following the- 
orem proves that Sj[Vj] preserves the mean-square norm of stationary pro- 
cesses. 
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Theorem 4.3 If the wavelet satisfies the admissibility condition ^3^) and if 
X is stationary with < oo then 

E{\\Sj[Vj]Xf)=E{\X\') . (97) 

Proof: The proof of fl97|) is almost identical to the proof of fl3Tl) in The- 
orem I2.6[ if we replace / by X, |/(ci;)p by the power spectrum RX{u) 
and ll/f by E{\X\'^). We proved that E{\\WjXf) = E{\X\'^) so we also 
have -E(||[/jXp) = i?(|Xp). In the derivations of Lemma 12.81 replacing 

fp = U[p]f by Xp = U[p]X, and |/p(a;)P by RXp{u), proves that 

I E{\\U[Pj]Xr) < max(J + 1, 1) Ei\Xf) + Ei\X^^,^') . 

j>0 reG+ 

Since Vj = UmgNA™, if the right hand-side term is finite then 

lim E{\\U[Ay]Xf) = . (98) 

m—^oo 

The same density argument as in the proof of Theorem 12.61 proves that f l98|) 
also holds if E'dXp) < oo because RX{u) is integrable. 

Since E{\\UjXf) = E{\X\^) and UjU[AJ]X = {Sj[AJ]X , U[A'J+^]X}, 
iterating m times on Uj proves as in f l34p that 

m— 1 

^(1^1') = E msj^xf) + Em^im') ■ 

n=0 

When m goes to oo, (^5]) implies ( P7|) . □ 

A windowed scattering Sj[p] = U[p]X ^^(pj averages f/[]5]X over a domain 
whose size is proportional to 2-^. If f/[p]X is ergodic, it thus converges to 
SX{p) = E{U[p]X) when J goes to oo. The windowed transformed scatter- 
ing Sj[Vj]X is said to be a mean-square consistent estimator of SX if its 
total variance over all paths converges to zero: 

hm Ei\\Sj[Vj]X-SjX\\')= hm T Ei\Sj[p]X -SXip)\') = . 

J— J-oo J—^oo ' 

Mean-square convergence implies convergence in probability and hence that 
Sj[Vj]X converges to SX with probability 1. 

For a large class of ergodic processes X, including Gaussian processes, 
mean-square convergence is observed numerically, with E{\Sj[Vj]X—SjX)\'^) < 
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C2~" for C > and a > 0. When J increases, the global variance of 
Sj[Pj]X decreases despite the path subdivision into new paths because each 
modulus reduces the variance by removing random phase variations. The 
variance of Sj[p]X thus decreases when the path length increases, and it is 
concentrated over a small number of frequency-decreasing paths. For a Gaus- 
sian white noise and a moving average Gaussian process of unit variance. 
Figure m shows that log E{\\Sj[Vj]X — SXW^), computed over all frequency- 
decreasing paths, decays linearly as a function J. For the correlated Gaussian 
process, the decay begins for 2"^ > 2^, which is the correlation length of this 
process. Indeed, the averaging by 02-^ effectively reduces the estimator vari- 
ance when 2"^ is bigger than the correlation length. 

Conjecture 4.4 If X is a Gaussian stationary process with ||i?X||i < oo 
then Sj[PjX] is a mean-square consistent estimator of SX. 




1 23456789 10 11 



Figure 4: Decay of log2 E{\\Sj[Pj]X — SXW^) as a function of J for a Gaussian 
white noise X (bottom line) and a moving average Gaussian process (top 
line), along frequency- decreasing paths. 

The following corollary of Theorem 14.31 proves that mean-square consis- 
tency implies an expected scattering energy conservation. 

Corollary 4.5 For an admissible scattering wavelet which satisfies condition 
i[3U\) . Sj[Vj]X is mean-square consistent if and only if 

\\SX\\^ = E{\X\^) , (99) 

and mean-square consistency implies that for all X G A^o 

J2 \SXi\ + p)\' = Ei\X^i;,\') . (100) 
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Proof: It results from Theorem|13]that£;(||^j[Pj]Xf) = E{\X\'^). Since 
E{\\Sj[Vj]Xr) = J2 E{Sj[p]Xf + E{\Sj[Vj]X - EiSj[Vj]X)\'), 

and E{Sj\p\X) = SX{p), we derive that limj^oo E{\\Sj[Vj]X-E{Sj[Vj]X)f) 
Oifandonlyif = E{\X\'^). Moreover, for all A e Aoo, since f/[p]f/[A]X = 

U[X+p]X, applying to U[X\X instead of X proves ffTUUD . □ 

The expected scattering can be represented by a singular scattering spec- 
trum in VoD- Similarly to Section I3l2| we associate to SX{p) = E{U[p]X) a 
function that is piecewise constant in Voo'- 

Vg G Poo , PjX{q) = J2 SXipr ^ff^ . (101) 

The following proposition proves that Pj converges to a singular measure, 
called a scattering power spectrum. 

Proposition 4.6 PjX{q) converges in the sense of distributions to a Radon 
measure in Voo, supported in Voo: 

PX{q) = lim PjX{q) = V SX{pf6{q-p) . (102) 

Proof: For any p G Voo, the Dirac 6{p — q) is defined as a linear form 
satisfying /(g) 5{p — q)dfj,{q) = f{p) for all continuous functions /(g) of 
Voo relatively to the scattering metric. For all J G Z, ii{Cj{p)) = ||5'j[p]5||^, 
p G Cj{p). and Imij^oo l^{Cj{p)) = 0. We thus obtain the following conver- 
gence in the sense of distributions: 

hm i^iMi^ = %-p). 

j^oo wSjIpW 

Letting J go to cx) in ffTOB proves ffT02D . □ 

If Sj[Vj]X is mean-square consistent then fllOOl) implies that the scatter- 
ing spectrum PX{q) is related to the Fourier power spectrum RX{uj) by 



ic{\) 



PX{q)dfi{q) = E{\Xi.iPx\^) = — ! RX {u) \^{\-^ u)\^ du . (103) 

2ti J 
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Let q{u}) be the function of Proposition 13. 9^ which maps the Lebesgue mea- 
sure of M'^^ into the Dirac scattering measure of Voo- It defines a scattering 
power spectrum PX{q{u)) over the half frequency space u G M'^"^. In dimen- 
sion d=l, q-\C{2^)) = [IIV-II^S^' , ll^f 2^+1), so (dm]) imphes 

/■iiv'f 2^+' I r ^ 

/ PX{q{u))du= — / RX{uj) \ij{2^uj)\^du . 

Jup2o 27r J 

Although PX(q(u)) and RX(u) have the same integral over dyadic frequency 
intervals, they have very different distributions within each of these intervals. 
Indeed, (l93l) shows that if p is of length m then E{U[p]X) depends upon 
normalized moments of X of order 2"^. It results that PX{q{u)) depends 
upon arbitrarily high order moments of X where as RX{u) only depends 
upon moments of order 2. Hence, PX{q) can discriminate different stationary 
processes having same Fourier power spectrum and thus same second-order 
moments. 

Figure E] gives the scattering power spectrum of a Gaussian white noise 
X2 and of a Bernoulli process Xi in dimension d = 1, estimated from a 
realization sampled over = 10^ integer points. Both processes have a 
constant Fourier power spectrum RXi{u) = 1 but very different scattering 
spectrum. Their scattering spectrum PXi{q{co)) is estimated by PjXi{q{co)) 
in f llOip at the maximum scale 2-^ = A^. It is a sum of spikes in Figure 
Mjo), which converges to a Radon measure supported in Voo when increasing 
2'^ = N. A Gaussian white noise X2 has a scattering spectrum mostly 
concentrated on paths q{u) = (2-^) of length 1. These scattering coefficients 
appear as large amplitude red spikes at dyadic positions, in the bottom graph 
of Figure El^b). Their amplitude is proportional to SX2{2^y ~ 2^ . Other 
spikes in green, correspond to paths q^u) = (Ai,A2) of length two. They 
have a much smaller amplitude. Scattering coefficients for paths of length 3 
and 4, in blue and violet, are so small that they are not visible. The top of 
Figure[5l^b) shows the scattering spectrum PjXi{q{uj)) of a BernouiUi process 
Xi. It has a maximum amplitude for paths q{u}) of length 1 (in red), but 
longer paths shown in green, blue and violet also produce large scattering 
coefficients, as opposed to a Gaussian white noise scattering. Scattering 
coefficients for paths p of length m depend upon the moments of X up to 
the order 2*". For m > 1, large scattering coefficients indicate a strongly 
non-Gaussian behavior of high order moments. 
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(a) (b) 

Figure 5: (a): Realization of a Bernoulli process Xi{x) at the top and a 
Gaussian white noise X2{x) at the bottom, both having a unit variance, 
(b): Scattering power spectrum PXi{q{u)) of each process, as a function of 
u G [0,7r]. The values of PXi{q{u)) are displayed respectively in red, green, 
blue and violet, for paths q{u}) of length 1, 2, 3 and 4. 



4.2 Random Deformations 

We now show that the scattering transform is nearly Lipschitz continuous to 
the action of random deformations. If r is a random process with ||Vr||oo = 
|Vr(a;)| < 1 then x — t{x) is a random diffeomorphism. If X{x) and r(x) are 
independent stationary processes then the action of this random diffeomor- 
phism on X{x) defines a randomly deformed process LrX{x) = X{x — t{x)) 
which remains stationary. 

The following theorem adapts the result of Theorem 12 . 1 2 1 by proving that 
the scattering distance produced by a random deformation is dominated by 
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a first-order term proportional to E'dlVrH^). Let us denote 

where A™ is the set of paths p = {Xk)k<m of length m with |Afc| < 2'^. 

Theorem 4.7 There exists C such that for all independent stationary pro- 
cessesT andX satisfying ||Vr||oo < 1/2 with probability 1 , if E{\\U[Vj]X\\i) < 
oo then 

E{\\Sj[Pj]LrX-Sj[Pj]Xf)<CE{\\U[Pj]X\UfK{T) (104) 

with 



K{t) = E (^(2-^l|r|U + l|Vr|U(log^^ V 1) + \\HT\\Jj'^ 



(105) 



Over the subset Vj^m of path in Vj of length strictly smaller than m 

Ei\\Sj[Pj,„^]L^X - Sj[Vj,^]Xf) <CmE{\X\^)K{T) . (106) 

Proof: Similarly to the proof of Theorem 12.121 we decompose 

Ei\\Sj[Pj]LrX - Sj{Vj]Xf) < 2Ei\\LrSj[Vj]X-Sj[Vj]Xf) 

+2E{\\[Sj[Pj],L,]Xr). 

Appendix El proves that E{\\[Sj[Vj] , Lr]X\\^) < E{\\U[rj]X\\if B{t) with 



B{r) = C^'i^ ((l|Vr|U(log^^ V 1) + ||ifr|U)'^ 



(107) 



and since 

E{\\USj[Vj]X - Sj[Vj]Xf) < C E{\\U[Vj]X\Uf E{2-'\\r\\l) , (108) 

we get fll04p . The commutator [Sj [Pj] , Lt-] and LrSj[Pj] — Sj[Vj] are ran- 
dom operators since r is a random process. The key argument of the proof 
is provided by the following lemma which relates the expected L^(R°') sup 
norm of a random operator to its norm on stationary processes. This lemma 
is proved in Appendix O 
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Lemma 4.8 Let Kj. be an integral operator with a kernel kr{x,u) which 
depends upon a random process r. If the following two conditions are satisfied 

E{kA., u) K(., u')) = K(.-u, .-W) am j flMl < oo . 

then for any stationary process Y independent of t, E{\Kt-Y{x)\'^) does not 
depend upon x and 

E{\K,Y\^)<E{\\K4')E{\Y\^) , (109) 

where \\Kr\\ is the operator norm in L^(]R'^) for each realization of t. 

This result remains valid when replacing Sj[Pj] by Sj[Vj^m] and U[Pj] 
by U[Vj^m]- With the same argument as in the proof of (!62|) . we verify that 

E{\\U[Vj,^]X\\,)<mE{\X\Y' 

which proves fll06p . □ 

Small stationary deformations of stationary processes result in small mod- 
ifications of the scattering distance, which is important to characterize de- 
formed stationary processes as in image textures [3]. The following corollary 
proves that the expected scattering transform is almost Lipschitz continuous 
in the size of the stochastic deformation gradient Vr, up to a log term. 

Corollary 4.9 There exists C such that for all independent stationary pro- 
cessesr andX satisfying ||Vr||oo < 1/2 with probability 1 , if E{\\U[V oo]X\\i) < 
oo then 

\\SL,X - 5Xf < C E{\\U[Voo]\\i) E{\X\^) K{t) (110) 

with 

1 1 1 1 ^ 

ir(r) = ^|(||Vr|U(logi^^Vl) + ||ifr|U) }• (HI) 

Proof: Ei\\Sj[Vj]L^X~Sj[Vj]Xr) < \\EiSj[Vj]L,X)-EiSj[Vj]X)r, 
so letting J go to oo in (11041) proves (11 101) . □ 

5 Invariance to Actions of Compact Lie Groups 

Invariant scattering are extended to the action of compact Lie Groups G. 
Section ISTT] builds scattering operators in L^(G') which are invariant to the 
action of G. Section [5^ defines a translation and rotation invariant operators 
on L^(R°') by combining a scattering operator on L^(]R'^) and a scattering 
operator on L'^{SO{d)). 
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5.1 Compact Lie Group Scattering 

Let G be a compact Lie group and L^(G') be the space of measurable functions 
/(r) such that = Jq \ f{r)\'^dr < oo, where dr is the Haar measure of G. 
The left action of g E G on f E L^(G) is defined by Lgf{r) = f{g~^r). This 
section introduces a scattering transform on L^(G), which is invariant to the 
action of G. It is obtained with a scattering propagator which cascades the 
modulus of wavelet transforms defined on L^(G). 

The construction of Littlewood-Paley decompositions on compact mani- 
folds and in particular on compact Lie groups was developed by Stein [T9] . 
Different wavelet constructions have been proposed over manifolds [T^ . Geller 
and Pesenson |i8] have built unitary wavelet transforms on compact Lie 
groups, which can be viewed as analogs of unitary wavelet transforms on the 
circle in M^. In place of sinusoids, they use the eigenvectors of the Laplace- 
Beltrami operator of an invariant metric defined on the group. Similarly to 
Meyer wavelets [T^ , these basis elements are regrouped into dyadic subbands 
with appropriate windowing. For any 2"^ < 1, it defines a scaling function 
(/)2i(r) and a family of wavelets {V'2j (^)}-l<j<oo which are in L^(G) The 
wavelet coefficients of / G L^(G') are computed with left convolutions on the 
group G for each X = 2^: 

W[\]fir) = f^^-,ir)= [ fig)ii-,ig-'r)dg (112) 

Jg 

and the scaling function performs an averaging on G: 

ALf{r) = f^Mr)= [ fig)M9''r)dg . (113) 

Jg 

The resulting wavelet transform of / G L^(G') is 

Wl/ = {Al/, (W^[A]/)xeAj with Al = {\ = 2^ : J > -L} . 

At the maximum scale 2^ = 1, since 0i(r) = (J^ dg) = \G\^^ is constant, 
the operator Aq performs an integration on the group: 

Aofir) = \G\-' [ f{g)dg = cst. (114) 
Jg 

Wavelets are constructed to obtain a unitary operator [8] 

WlIW = 11/11 , (115) 
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with ^ ^ 

\\WLfr = \\A,fr+ J2 

The Abehan group G = S0{2) of rotations in is a simple example 
parametrized by an angle in [0,27r]. The space L^(G') is thus equivalent 
to L^[0,27r]. Wavelets in L^(G') are the well-known periodic wavelets in 
L^[0,27r] [H]. They are obtained by periodizing a scaling function 4>2l{x) = 
2-^(f){2-^x) and wavelets ^2i{x) = 2^i){2^x) with {(j),^) G 

02i(a^) = <t>2'-{x ~ Svrm) and ip2:i{x) = V^2j(a^ — 27rm) . (116) 

We suppose that 0(0) = 1 and 0(2A;7r) = for G Z — {0}. The Poisson 
formula implies that (j){x) = 'Ylm&'i^i^ — n) = 2tx. Convolutions flll2p and 
f lllSp on the rotation group are circular convolutions of periodic functions 
in L^[0,27r]. With the Poisson formula, one can prove that the periodic 
wavelet transform Wl is unitary if and only if G L^(M)^ satisfy the 

Littlewood-Paley equahties ([9]). 

For a general compact Lie group G, we define a wavelet modulus operator 
by U[X]f = |Wl[A]/|, and the resulting one-step propagator is 

Since Wl is unitary, we verify as in fl26|) that Ul is nonexpansive and preserves 
the norm in L^(G). 

A scattering operator on L^(G') applies Ul iteratively. Let Vl denote the 
set of all finite paths p = {Ai,...,Am} of length m, where Afc = 2-"= G A^. 
Following Definition 12.2^ a scattering propagator on L^(G') is a path ordered 
product of non-commutative wavelet modulus operators 

U[p]=U[~Xm]...U[~Xi] , 

with [/[0] = Id. 

Following Definition \2A\ a windowed scattering is defined by averaging 
U[p]f through a group convolution with 

SL[p]fir) = ALU[p]f{r) = [ U[p]f{g)M9''r)dg . (117) 

Jg 
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It yields an infinite family of functions SlIVl]/ = {'S'L[p]/}pg-p^, whose norm 
is 

\\SL[VL]fr=J2\\^^lPW with \\SL[P]fr= I \SL[P]f{r)?dr . 

Since U\Pl\ is obtained by cascading the nonexpansive operator Ul-, the 
same proof as in Proposition 12.51 shows that it is nonexpansive: 

V(/,/i)eL2(G)2 , \\S^[V^]f-SL[VL]h\\<\\f-h\\. 

Since U\Pl\ preserves the norm in L^(G'), to also prove as in Theorem 12.61 
that = 11/11, it is necessary to verify that lim^^oo ||f/[Ai]f = 0. 

For the translation group where G = M'^, Theorem 12.61 proves this result by 
imposing a condition on the Fourier transform of the wavelet. The extension 
of this result is not straightforward on L^(G') for general compact Lie groups 
G, but it remains valid for the rotation group G = S0{2) in M?. Indeed, 
periodic wavelets ^ L^(50(2)) = 27r] are obtained by periodizing 
wavelets ip^^ G L^(M) in (I116p . which is equivalent to subsample uniformly 
their Fourier transform. If ip satisfies the admissibility condition of Theorem 
12.61 then by replacing convolutions with circular convolutions in the proof, 
we verify that the periodized wavelets ip^^ define a scattering transform of 
L^[0,27r] which preserves the norm ||>S'i[Pi]/|| = ||/||. 

When 2^ = 1, Aq is the integration operator flll4p on the group, so 
So\p]f{r) does not depend on r. Following Definition 12. 3^ it defines a scatter- 
ing transform which maps any / G L^(G') into a function of the path variable 
p: 

ypeVo , So[P]f=\G\-' I U[p]f{g)dg. (118) 

Jg 

Over a compact Lie group, the scattering transform S'o['Po]/ = {So[P]f}p(^Vg 
is a discrete sequence in P(Po)- The following proposition proves that it is 
invariant to the action Lgf{r) = f{g~^r) of g E G on f E L^(G'). 

Proposition 5.1 For any f G L^(G') and g E G 

So[Vo]Lgf = So[Vo]f. (119) 
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Proof: Since Aq and Vr[A]/ are computed with left convolutions on G, 
they commute with Lg. It results that U[\] and hence Sq[Pl] also commutes 
with Lg. If p G Vq, since So[p]f{r) is constant in r, So[p] Lgf = Lg So[p]f = 
So[p]f, which proves flllQp . □ 

As in the translation case, the Lipchitz continuity of Sl to the action of 
diffeomorphisms relies on the Lipschitz continuity of the wavelet transform 
Wl- The action of a small diffeomorphism on / G L^(G') can be written 
Lrf{r) = /(T(r)~^r) with r(r) G G. The proof of Theorem 12. 121 on Lipschitz 
continuity applies to any compact Lie groups G. The main^ifficulty is to 
prove Lemma I2.14[ which proves the Lipschitz continuity of Wl by comput- 
ing an upper bound of the commutator norm || [1^^, L,-] ||. The proof of this 
lemma can still be carried by applying Cotlar's lemma, but integration by 
parts and the resulting bounds require appropriate hypothesis on the regu- 
larity and decay of ip\- If G = 5*0 (2) then the proof can be directly adapted 
from the proof on the translation group, by replacing convolutions with cir- 
cular convolutions. It proves that Sl is Lipschitz continuous to the action of 
diffeomorphisms on L^(S'0(2)). 

5.2 Combined Translation and Rotation Scattering 

We construct a scattering operator on L^(R°') which is invariant to transla- 
tions and rotations, by combining a translation invariant scattering operator 
on L^(M'^) and a rotation invariant scattering operator on Li'^{SO{d)). 

Let G be a rotation subgroup of the general linear group of M.'^, which 
also includes the reflection —1 defined by —Ix = —x. According to @, 
the wavelet transform in L^(R'^) is defined for any A = 2^r G 2^ x G by 
iy[A]/ = f -k ipx, where ipxi^) = 2'^^ip{2^r~^x). Section [2.11 considers the 
case of a finite group G, which is a subgroup of SO{d) if d is even or which 
is a subgroup of 0{d) if d is odd, while including —1. The extension to a 
compact subgroup potentially equal to SO{d) or 0{d) is straightforward. We 
still denote G~^ the quotient of G by {—1,1}. The wavelet transform of a 
complex valued functions is defined over all A G 2^ x G but it is restricted 
to 2^ X G^ if / is real. All discrete sums on G and G^ are replaced by 
integrals with the Haar measure dr. The group is compact and thus has a 
finite measure |G| = ^Qdr. It results that these integrals behave as finite 
sums in all derivations of this paper. The theorems proved for a finite group 
G remains valid for a compact group G. In the following we concentrate on 
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real valued functions. 

Let Vj be the countable set of all finite paths p = (Ai, A^) with G 
Aj = {A = 2^r : j > — J, r G G~^}. The windowed scattering Sj[Vj]f = 
{Sj[p]f}p(zpj is defined in Definition 12 ■4[ but A j and Vj are not discrete sets 
anymore. The scattering norm is defined by summing the L^(M'^) norms of 
all Sj[p]f for all p = {2^^ri, 2^"^^) G Vj, with the Haar measure: 

oo „ 

\\Sj[Pj]ff = y2 E / \\Sj[2^'ru...,2^-rM'dn...dr^ , 

m=0 ji>-J,...,jm>-J ^ 

which is written 



\sj[Vj]fr= [ \\sj[p]frdp. 



One can verify that Sj[Vj] is nonexpansive as in the case where G is a finite 
group. For an admissible scattering wavelet satisfying ( 130|) . Theorem 12.61 
remains valid and = ||/||. 

The scattering Sj is covariant to rotations. Invariance to rotations in 
G is obtained by applying the scattering operator Sl defined on L^(G) by 
f lll7p . Any p = (Ai, Am) ^Vj with Ai = r2^^ can be written as a rotation 
p = rp of a normalized path p = (Ai, Am), where A^ = r~^Xk and hence 
where Ai = 2^'^ is a scaling without rotation. It results that 

Sj[p]f{x) = Sj[rp]f{x) . 

For each x and p fixed, Sj[rp]f{x) is a function of r which belongs to L^(G). 
We can thus apply the scattering operator Sl[p] to this function of r. The 
result is denoted SL[p]Sj[rp]f{x) for all p eVl- This output can be indexed 
by the original path variable p = rp, and we denote the combined scattering: 

Sl\p] Sj[p]f{x) = Sl[p] Sj[rp]f{x) . (120) 

This combined scattering cascades wavelet transforms and hence convolutions 
along the spatial variable x and along the rotation r, which is factorized from 
each path. In d = 2 dimensions then L^(5'0(2)) = L^[0,27r]. The wavelet 
tranform along rotations is implemented by circular convolutions along the 
rotation angle variable in [0,27r], with the periodic wavelets flll6p . 
A combined scattering transform computes 



SL[VL]Sj[Vj]f = {SL[p]Sj[p]f} 
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Its norm is computed by summing the L^(R'^) norms ||5'2,[p]5'j[p]/|p: 

\\SL[VL]Sj[Vj]fr = j2 [ \\SL[P]Sj[p]fr dp . (121) 

Since Sl\Pl\ and Sj\Pj\ are nonexpansive their cascade is also nonexpansive: 

V(/,/i)eL2(M'^)2 , \\SAVL]Sj[Vj]f-SL[VL]Sj[Vj]h\\<\\f-h\\. 

If Sj is computed with an admissible scattering wavelet then ||>S'j[Pj]/|| = 
||/||. In (i = 2 dimensions, if Sl is computed with periodic wavelets de- 
rived in flll6p from a one-dimensional admissible scattering wavelet, then 
the combined scattering preserves the norm: 

\\SL[VL]Sj[Vj]f\\ = \\Sj[Vj]f\\ = 11/11 . 

By setting L = and letting J go to cxd, the following proposition proves 
that the resulting combined scattering is invariant to translations and rota- 
tions. Such combined scattering representations are used for rotation invari- 
ant classification of image textures [18]. For any {c,g) G M'^ x SO{d), we 
denote Lc,gf{x) = f{g~^{x - c)). 

Proposition 5.2 For all {c,g) eR'^x SO{d) and all f G L'^{M.'^) 

V(p,p) G Po X Poo , So[p]S{p)L,J = So[p]S{p)f . (122) 

Proof: The scattering transform in L^(M'^) is translation invariant and 
covariant to rotations: SLc^gf{p) = Sf{g^^p) for all p G Poo- Since g~^ acts 
as a rotation on the path p, Proposition 15.11 proves that So[p]S f {g^^ p) = 
So[p\Sf{p), which gives (^M)- □ 



A Proof of Lemma 2.8 



The proof of fl371) shows that the scattering energy propagates towards lower 
frequencies. It computes the average arrival log frequency of the scattering 
energy ||[^[p]/|| for paths of length m, and shows that it increases when 
m increases. The arrival log frequency of p = {Xk = 2-''=rfc}fc<m is the log 
frequency index loga |Am| = jm of the last path element. 
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Let us denote = \\U[K'J]ff and = \\Sj[K'J]ff. The average 
arrival log frequency among paths of length m is 

«m = e-l JrrAWWr > -J . (123) 

The following lemma shows that when m increases by 1 then a^, decreases 
by nearly a/2, where a is defined in (130|1 . 

Lemma A.l // ^W\j is satisfied then 

Vm > , - Cra-i < {am + J)em - (Om+l + J)em+i + e^-i - Cm ■ (124) 

We first show that (1124p implies ( l37l) and then prove this lemma. Sum- 
ming over f ll24p gives 

1 

CM V— \ 

- 2^ efc < (ai + J)ei - (a^+i + J)e^+i + Cq - < Cq + (ai + J)ei . (125) 

fc=0 

For m = 1, p = 2^r so Oi ei = X]j>-j X]rgG+ ^ II^P"''^]/!!^- Moreover, 
eo=||/|P, so 

eo + (ai + J)ei = + J2 J2 + ll^[2M/f • 

i>-JrGG+ 

Inserting this in (11250 for m = oo proves (137|) . 

Lemma lA.ll is proved by calculating the evolution of as m increases. 
We consider the advancement of a path p of length m — 1 with two steps 
p + 2-'r + 2V, and denote fp = U[p]f. The average arrival log frequency am 
can be written as the average arrival log frequency of ||f/[p + 2-'r]/||^ over all 
2-^r and all p of length m — 1: 

amem= Yl 5Z 5Z-^'ll^f*^2.rf ■ (126) 

peAy-'' j>-J r€G+ 

After the second step, the average arrival log frequency of ||t/[j9+2-'r+2'r']/|p 
overall p G A™~^, 2^r and 2V is am+i- 

am+iem+l= X] X] X] X] Mll/p*^2^r| *^2'r'r- 

pgA^-i i>- J rGG+ />- J r'GG+ 
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The wavelet transform is unitary and hence for any h G L^(M ) 



l>-Jr'£G+ 



Apphed to each h = fp 'kip2jr in f H^^p this relations, together with 
shows that I = amGm — dm+i Cm+i + Satisfies 



'2'r 



peA"^-^ j>-J r'£G+ l>-JreG+ 

+ U + J) lll/p*^2:'r| *02-HP) • 

A lower bound of / is calculated by dividing the sum on / for / > j and I < j. 
In the j + J — 1 term for I < j, the index / is replaced by j — 1 and the 
convolution with is incorporated in the sum: 



^ s E E E 



J<l<j \reG+ / 



'l|2 



l>j rgG+ 



4m) 



Since wavelets satisfy the unitary property for all real functions / G 
L2(R'^) and all g G Z, 



E E \\f^^2m' + \\f^4>2A\' = \\f*M' 

-q>l>-J reG+ 



128) 



Indeed ([9]) implies that 



(129) 



-q>l>-Jr&G 
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If / is real, then ||/*V'2.,|| = ||/ * V^-2.r||- Multiplying (^W) by and 
integrating in co proves (11281) . Inserting (11281) in (I127p gives 

^ ^ E E E (iii/p*^2.vh02-.+iip 

pi^A"/-'^ j>~J r&G+ 

-^i^-j) (lll/p*^2>r| *02-nP - lll/p*V^2^>K02-'+if)) • 
l>j 

If p > satisfies |p(w)| < |0(2u;)|, then for any / G L^{R'^) and any / G Z, 

II/* 02-^+1 f > ll/*P2'rf with P2lrix)=2'''p{2'r-^X) . 

It results that 

^ > E E E (III ^p^^s^'- 1*^2^'- ir 

-E(^~^') (ll/p*V'2^r|P - |||/p*^2^r| *P2'rf)) • 
l>j 

Applying Lemma [2.71 for h = p2'r ^iiid a frequency 2^7/ proves that 

lll/p* V'2ir| *P2'rll > II /p * V'2Jr * P2'r,2^ II with p2lr,2J (x) = P2lrix) c'^' 

and P2tr,2i{^) = p{2'^r~^u — 2^^^!]). It results that 

^ > E E E (ll/p*V^2^r*P2^r,2Hr 

-^i^-j) (ll/p*V'2J>f - ll/p* V'2ir*P2'r,2^ir)) • 

We shall now rewrite this equation in the Fourier domain. Since fp{x) G 
M, |/p(a;)| = |/p(-w)|, applying Plancherel gives 

^ > ^ E /i/pMI^EE (l^(2-^'^"'^)np(2-V-^a;-.^)r 

pGA^-i r€Gj>~J 

- E(^ - m'^''r-'co)\' (1 - |p(2-V-^a; - 2^-'r^)\') ) dco . 
i>j 
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Inserting defined in ([22]) by 



+00 



k=l 

with k = I — j, gives 



with h{oj) = Y.reG^i'^'^^) \'4'ir~'^(^)? ■ Let us add to / 

■ — 1 ■ vn — 1 



Since p > 0, |p(ci;)| < p(0) = 1 and hence ^(w) < 1. The wavelet unitary 
property ([9]) together with '^{uj) < 1 imphes that 



j<-Jr£G j<-J 

SO 

i+em-i>- E / i/pMI' E 

_^Am-l j= — OO 



If a = inf i<|^|<2 Y.j b{2-^uj) then 6(2--'w) > a for all w ^ 0. If the 
hypothesis (l30l) is satisfied and hence a > then 



i+e^m-i > f E / i/pMr^^ = f E 



= f E IIWir = fe._i. 

peA^-i 

Inserting J = — flm+i Cm+i + J proves that 
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Since Uj preserves the norm, = e^+i + e^, indeed (1251) proves that 
UjU[AJ]f = {f/[A7+V , '^jIA^]/}. Inserting e„ = e„ - e^+i and e„_i = 
f llSOp gives 

^ Cm— 1 ^ (c^jn ~l~ J^^rn iflni+l ~(~ <-^)Cm+l ~(~ Cm— 1 Cm ; 

which finishes the proof of Lemma lA.li 



B Proof of Lemma 12.11 



Lemma 12.111 as well as all other upper bounds on operator norms are com- 
puted with Schur's lemma. For any operator Kf{x) = J f{u)k{x,u)du, 
Schur's lemma proves that 

\k{x,u)\dx<C and j \k{x,u)\ du < C =^ \\K\\ < C , (131) 

where \\K\\ is the L^(M'^) norm of K. 

The operator norm of fcj = LrA j — is computed by applying Schur's 
lemma on its kernel 

kj{x, u) = (p2j{x — t{x) — u) — (l)2j{x — u) . (132) 

A first-order Taylor expansion proves that 

\kj{x,u)\< \ I V02j(a;— M— t r(x)) . r(x) < ||r||oo / \V(j)2j{x—u—tT{x))\dt 
Jo Jo 



so 

\kj{x,u)\du < \\t\\oo J J \V(f)2j{x — u — tT{x))\dudt . (133) 

Since V02j(x) = 2-'^'^-^V0(2-"^x), it results that 

\kjix,u)\du< \\t\\^2-^^-^ j |V0(2-V)|rfM' = 2-^||r|U||V0||i . 

(134) 

Similarly to (1133!) we prove that 

|fcj(x, m)| < ||r||oo / / \V4)2j{x — u — tT{x))\dx dt . 
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The Jacobian of the change of variable v = x — tT{x) is 1 — tVr(x) whose 
determinant is larger than (1 — ||Vr||oo)'^ > S^'^ so 



\kj{x,u)\dx < ||r||oo2" 



\V4>2j{v — u) \ dvdt 



k||oo||V0||i2^ 



Schur's lemma (I13ip applied to this upper bound and (11341) proves the lemma 
result: 

\\L,Aj-Aj\\<2-'+''\\V<j)h Iklloo . 



C Proof of ([69]) 



We prove that 

WL^Ajf -Ajf + T. VAjfW < C 11/11 2-2^ llrll^ (135) 

by applying Schur's lemma (11311) on the kernel of kj = LrAj — Aj + r ■ VAj: 

kj{x, u) = 4>2j{x — r(x) — n) — 02j(a; — u) + V(t>2j{x — u) ■ t{x) . 

Let Hf{x) the Hessian matrix of a function f at x and \Hf{x)\ the sup 
matrix norm of this Hessian matrix. A Taylor expansion gives 



\kj{x,u) 



< T 



tr{x) • H(j)2j{u — X — (1 — t)T{u)) ■ t{x) dt 



\t\\H(t)2j{u -x-{l-t) T{x))\dt . (136) 



Since ct)2j{x) = 2-'^-^(j){2-^x), H(f)2j{x) = 2-'^'^-^-^H(f){2--^x). With a change 
of variable, (11361) gives 



\kj{x,u)\du < \\r\\l^2 



-dJ-2J 



I m{2-\')\du' = 2-''\\T\\l\\H<P\\, 



(137) 

where ||i?0||i = / \H(f){u)\ du is bounded. Indeed all second-order derivatives 
of at M are 0((1 + \u\)~'^~'^). 

The upper bound (I136p also implies that 



\kj{x^ u)\dx < ||r| 



1 

, |t| / \H(j)2j{u — X — {1 — t)T{x))\dudt . 





61 



The Jacobian of the change of variable v = x — (1 — t) r(x) is 1 — (1 — t)Vr(x) 
whose determinant is larger than (1 — ||Vr||oo)'^ so 

j\kj{x,u)\dx < ||r||^(l - ||Vr||oo)"'^ y j \H^2-j{v - u)\ dv dt 

= 2-''\\r\\l\\H<P\U2''. (138) 
The upper bounds fll37p and fll38p with Schur's lemma fll3ip proves fll35p . 



D Proof of Lemma 12.13 



This appendix proves that for any operator L and any / G L^(]R^) 

oo 

\\[Sj[Vj],L]f\\ < \\[Uj,L]\\ \\U[Vj]fh = \\[Uj, L]\\ 5^||f/[A:}]/||. (139) 

n=0 

If A and B are two operators, we denote {A, B} the operator defined by 
{A,B}f = {Af,Bf}. We introduce a wavelet modulus operator without 
averaging: 

Vjf = {\W[X]f\ = \f^iJx\}xeAj with Aj = {2h : j>-J,reG^}, 

(140) 

and Uj = {Aj , Vj}. The propagator Vj creates all paths VjU[Aj]f = 
f/[A}+^]/ for any n > 0. Since f/[A|}] = Id, it results that Vy = U[A^]. 
Let Vj^m be the subset of Vj of paths p of length smaller than m. To verify 
(I139p . we shall prove that 

m 

[Sj[Vj,m], L]=J2 Km-nVj^ , (141) 
n=0 

where = {[Aj,L] , Sj[Vj,n~i][Vj, L]} satisfies 

ll^nll < (142) 

Since Vyf = ?7[A}]/, it imphes that for any / G V(M.'^) 

m m—1 

\\[Sj[Vj,miL]f\\ <Y,\\K^-n\\ wv^fw < \\[Uj,L]\\ J2 mmi , 

n=0 n=0 
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and letting m tend to oo proves fll39p . 

Property fll4ip is proved by first showing that 



Sj[Pj,^]L = {LAj , Sj[Vj^m-i]LVj} + , (143) 

where Km = {[Aj, L] , Sj[Vj,m-i] [Vj,L]}. Indeed, since = U[A% we 
have AjVy = Sj[A}] and Vj,^ = U^Jo^A} yields Sj[Pj,m\ = {AjVj"}o<n<m. 
It results that 

Sj[Pj^rn]L = {AjVjL}Q<n<m 

= {LAj+[Aj,L],AjVj^-'LVj + AjVj^-'[Vj,L]},<n<m 
= {LAj , Sj[Vj,m-i]LVj} + {[Aj, L] , Sj[Vj,m-i] [Vj, L]} 

= {LAj , Sj[Vj,m-l]LVj} + Km , 

which proves (11431) . 

A substitution of Sj[Pj^m-i]L in fll43p by the expression derived by this 
same formula gives 

Sj[Pj,m]L = {LAj , LAjVj , Sj[Vj,m-2]LVj} + Km-lVj + Km • 

With m substitions, we obtain 

m m 

Sj['Pj^m]L = {LAjVj}o<n<m + ^ Km-nVj = LSj[Pj^m] + ^ Km-nVj 

n=0 n=0 

which proves (11410 . 

Let us now prove (11420 on Km = {[Aj,L], Sj[Vj^m-i] [Vj,L]}- Since 
Sj[Vj] is nonexpansive, its restriction Sj[Pj^m] is also nonexpansive. Given 
that Uj = {Aj , Vj} we get 

WKmff = \\[Aj,L]fr + \\Sj[Vj,m-i][Vj,L]fr 

< \\[Aj,L]fr+\\[Vj,L]fr = \\[Uj,L]fr<\\[Uj,L]r^^^^^' 

which proves (I142p . 



E Proof of Lemma 12.14 



This section computes an upper bound of by considering 

oo 

[Wj,L,]*[Wj,L,]= J2 E m2^r],L,]*[W[2^r],L,] + [Aj,L,Y[Aj,L,]. 

r&G+ j=-J+l 
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Since \\[Wj,L;\\\ = \\[Wj, L^]* [Wj, L^]\\'/^ 



II 1/2 

ii[w^.,^.]ii< E II E m2HLrnw[2^r],u] +\\[Aj,ur[Aj,u]r/'. 

reG+ j=-J+l 

(144) 

To prove the upper bound (!6T|) of Lemma 12.141 we compute an upper bound 
for each term on the right under the integral and the last term, which is done 
by the following lemma. 

Lemma E.l Suppose that h{x), as well as all its first and second-order 



derivatives have a decay in 0((1 + |x|) ^). Let Zjf = f -k hj with h 
2*7i(2-'x). There exists C > such that if ||Vr||oo < then 

||[Z,-,L,]|| <C||Vr|U (145) 

and if J h{x) dx = then 

1/2 „. / /. WAi 



J2 [Zj,LrnZj,L,]\\ <c(max(logL^, l) ||Vr|U + ||i/r|U) • 

(146) 



j=-oo 



The inequality (1146P clearly remains valid if the summation is limited to 
—J instead of — oo since [Zj, LT-]*[Zj, L^.] is a positive operator. Inserting in 
01441) both 01451) with h = (f) and 0146p with h{x) = ilj{r~^x) for each r G G+, 
and replacing — oo by — J proves the upper bound fl^ of Lemma 1!^. 141 

To prove Lemma fE.lt we factorize 

[Zj, Lr] = Kj Lr with Kj = Zj — Zj L'^^ . 

Observe that 

||[Z,-,L,]*[Z,-,L,]f/2 = \\LlK*KjLr\\^'^ < \\Lr\\ \\K* K.^'^ (147) 
and that 



+ 00 



1/2 



+ 00 



E [Z,.UnZ,,U] <\\LM J2 K*K,f 



/2 



(148) 



j=-oo 



j=-oo 
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with < (l-||Vr||oo)~'^. Since L;7(x) = f{^ix)) with ^(a; - r(x)) = x, 
the kernel of Kj = Zj — Lj. Zj is 

kj{x,u) = hj{x — u) — hj{x — t{x) — u + t{u)) det(l — Vr('u)) . (149) 

The lemma is proved by computing upper bounds of \\Kj \\ and || Xlj=^oo 
The sum over j is divided in three parts 

+00 — 7— 1 —1 00 

II E ^;^.-ii'''<ii E^;^.-ii'''+iiE^;^oii'''+iiE^;^^.-ii'''- 

j=-oo j=-oo j=-l j=0 

(150) 

We shall first prove that 



'r,|^^ 



(151) 



II E ^;i^,ir/'<c(||Vr|U + 2-^||Ar|U + 2-^/2||Ar||V2||V7 

j=-oo 

Then we verify that \\Kj\\ < C ||Vt||oo and hence that 

-1 

II E K; K.W'/' < 7 lli^.ll < C7 II Vr|U . (152) 

i=-7 

The last term carries the singular part and we prove that 

00 

||E^7^.ir^' <C(||Vr|U + ||i/r|U) . (153) 

j=0 

Choosing 7 = max (log [[^^[H , 1) yields 



||V 

E K;K,r/' < C (max(log , l) ||VrlU + ||i/r|U) 



j=-oo 



Inserting this result in fll48p will prove the second lemma result fll46p . In 
the proof, C is a generic constant which depends only on h but which evolves 
along the calculations. 

The proof of fll5ip is done by decomposing Kj = Kj^i + Kj 2, with a first 
kernel 

kj^i{x,u) = a{u) hj{x — u) with a(u) = (1 — det(l — Vr(M))) , (154) 
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and a second kernel 
kj^2{x, u) = det{l — 'VT{u))(^hj{x — u) — hj{x — t{x) — u + r{u))^ . (155) 

This kernel has a similar form as the kernel (11321) in Appendix [B] by r(x) is 
replaced here by r(x) — t{u). The same proof shows that 

\\K,4 <C2^lAr|U . (156) 
Taking advantage of this decay, to prove (I15ip . we decompose 

II E K;K,\\y^ < II f; ir;,Aiir/^ 

j=—oo J=— oo 

+ f;(||ir,,2||+2V2||i^^.,||l/2||i^^.^||l/2) 

j=-oo 

and verify that 



||i^i,i|| < C||Vr|U and || J] Kl,Kj,i\\'/^ < C||Vr||oo • (157) 

j=-oo 

The kernel of the self-adjoint operator K*-^^Kj^i is: 

kj{y, = J k^iix, y) kj,i{x, z) dx = a{y) a{z) hj -k hj{z - y) , 
with hj{u) = h*{—u). It results that the kernel of = ^j<o is: 



jyX) 



Hy,z) = ^kj{y,z) = a{y)a{z)e{z-y) with e{x) =^hj-kh 
j<o j<o 

Applying Young's inequality to \\Kf\\ gives 

||i^'||<sup |a(«)ni^^||i . 

Since e{u) = Ei<olM2^^w)P and h{0) = J h{x) dx = and /i is both 
regular with a polynomial decay, we verify that ||6'||i < oo. Moreover, since 
(1 - det(l - Vr(u))) > (1 - ||Vr||oo)'' we have sup„ |a(n)| < d \\Vt\\^ which 



66 



proves that < C||Vr||oo- Since < \\K\\ we get the same 

inequahty for which proves the two upper bounds of fll57p . 

The last sum Yl'^o ^j-^j carries the singular part of the operator, which 
is isolated and evaluated separately by decomposing Kj = Kj^i + Kj^2, with 
a first kernel 

= hj{x - u) - hj{{l - Vt{u)){x - u)) det(l - Vr(n)) (158) 
satisfying Kj il = J kj^i{x,u) du = if J h{x) dx = 0. The second kernel is 

kj^2ix,u) = det(l — Vr('u)) — Vr('u))(x — m)) — — r(a;)— 'U + r('u))j . 

(159) 

The sum J2j>o has a singular kernel along its diagonal, and its norm 

is evaluated separately with the upper bound 

oo oo oo 

j=0 j=0 j=0 

(160) 

We will prove that 

< C||Vr|U (161) 

and 

||J^,,2|| < C min(2-ii/r||oo , ||Vr|U) • (162) 

It implies that \\Kj\\ < C ||Vr||oo- Inserting this inequality in f ll47p yields 
the first lemma result fll45p and it proves (11520 . Equations (I16ip and (11620 
also prove that 

oo 

J2{\\K,M\ + 2'/^K,M\'^'\\K,,^r/') < C(|| VrlU + \\HrU) . (163) 

j=0 

If J h{x) dx = then thanks to the vanishing integrals of kj^i we will prove 
that 

oo 

\\J2kI,K,,i\\'^' < C (||Vr|U + \\Ht\\^) . (164) 

j=0 

Inserting ffTB5D and ffTMD in ffM?]) proves ffT^ . 

Let us now first prove the upper bound (I162p on Kj^2- The kernel of Kj 2 

is 

kj^2{x,u) = det{l — 'Vr{u))(^hj{{l — 'VT{u)){x — u)) — hj{x — r{x)—u + T{u))^ . 
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A Taylor expansion of hj together with a Taylor expansion of r(x) gives 

r(x) — t{u) = Vt{u){x — u) + a{u, x — u) (165) 

with ^ 

a{u,z)= j tzHT{u + {l-t)z)zdt , (166) 
Jo 

so 

kj,2{x,u) = -det(l - Vr(M)) (167) 
Vhj(^{l -tVT{u)){x -u) + {l-t) {t{u) - t{x))^ a{u,x- u) dt . 

For j > 0, we prove that ||-ft'j,2|| decays like . Observe that | det(l — 
Vt{u))\ < 2'^. Since Vhj{u) = 2^+'^^V/i(2%), the change of variable x' = 
2^{x — u) in (11671) gives 

j \kj^2{x,u)\dx < y j^yh[{l-tVT{u))x' 

+ (1 - t) 2^{t{u) - r(2-^x' + m))) Ta{u, 2~^x') dt dx' . 

For any < t < 1 

{l-tVT{u))x' + {l-t)2^{T{2~^x' + u)-t{u)) > \x'\ (l-||Vr||oo) > \x'\/2 . 
Equation (I166p also implies that 



\x 



i\2 



|2-^'a(M,2-V)| =2-^\ / tx' HT{u + {l-t)2-^x)x' dt\ < 2--'' ||i7r||oo • 

(168) 

Since |V/i('u)| <C(1 + |m|) ^, with the change of variable x = x'/2 we get 



\kj^2ix,u)\dx<C2~^\\Hr\\^. 



(169) 



For j < 0, we use a maximum error bound on the remainder a of the 
Taylor approximation fll65p : 

|2^'a(M,2-V)| < 2||Vr|U|x'| , 
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which proves that / \kj^2{x,u)\dx < C ||Vr||oo and hence that 

J \k,^2ix,u)\dx <C mm{2-^\Hr\\^, ||Vr|U) • (170) 

Similarly, we compute J \ kj^2{x,u)\du with the change of variable u' = 
2^{x — u) which leads to the same bound (11701) . Schur's lemma gives: 

||K,,2|| < C min(2-iifr||oo , || Vr|U) (171) 

which finishes the proof of fll62p . 

Let us now compute the upper bound (11611) on Kj^i. Its kernel kj^i in 
(11581) can be written kj^i{x, u) = 2'^^ g{u, 2^{x — u)) with 

g{u, v) = h{v) - h{{l - Vt{u))v) det(l - Vt{u)). (172) 

A first-order Taylor decomposition of h gives 

g{u,v) = (l-det(l- Vr(M)))/i((l- VT(n)t;) (173) 

+ [ Vh{{l -t)v + t{l -Vr{u))v) ■VT{u)vdt. 
Jo 

Since det(l-Vr(M)) > (1- 1| Vr||oo)'^ we get (l-det(l- Vr(M))) < ci||Vr||oo. 
Moreover ||Vr||oo < 1/2 and h{x) as well as its partial derivatives have a 
decay which is 0((1 + |x|)~'^~^), so 

\g{u,v)\<C\\VT\\^(^l + \v\y'~\ (174) 
so %i(x,m) = 0(2'^^ \\Vt\\oo{1 + 2^x -u\)-'^-'^y Since 

J \kj^i{x,u)\du = 0{\\Vt\\oo) and J \kj ^i{x, u)\ dx = 0{\\Vt\\ 00) , 

Schur's lemma (I13ip proves that H-ft'j,!!! = 0(||Vr||oo) and hence (I16ip . 

Let us now prove (11641) when j h{x) dx = 0. The kernel of the self-adjoint 
operator Qj = K*^^Kj^i is: 



kj{.y,z) = I k*-^{x,y)kj^i{x,z)dx = j 2'^'^^g*{y,2^{x -y))g{z,2^{x - z))dx 
2^^g*{y, x' + 2^{z - y)) g{z, x')) dx' . (175) 
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The singular kernel k = J2j of Qj almost satisfies the hypotheses of the 
T(l) theorem of David, Journe and Semmes [5] but not quite because it does 
not satisfy the decay condition \k{y, z) — k{y, z')\ < C\z' — z\°' \z — y\~'^~"' for 
some a > 0. We bound this operator with Cotlar's lemma [20] which proves 
that if Qj satisfies 

Vj, /, \\Q*Qi\\ < and \\Q,Q;\\ < - , (176) 

then 

Since Qj is self-adjoint, it is sufficient to bound HQiQ^H. The kernel of 
Qi Qj is computed from the kernel kj of Qj 

ki,j{y, ^) = J ^) u) du. (178) 

An upper bound of \\Qi Qj\\ is obtained with Schur's lemma (113 ip applied to 
ki j. Inserting fll75p in fll78p gives 

\ki,j{y,z)\dy = J I J g{u,x)g{u,x')2'^^g*{y,x + 2\u-y)) 

2^^' g*{z, x' + 2^{u - z)) dx dx' du\ dy .(179) 

The parameters j and / have symmetrical roles and we can thus suppose that 
3>l- 

Since / h{x) fix = it results from (11721) that / g{u,v) dv = for all u. 
For V = {vn)n<d, ouc cau thus write g{u,v) = ^^g"^^-* and (I174p implies that 

\9iu,v)\ < C ||Vr|U (l + |^|)~'~' ■ (180) 

Let us make an integration by parts along the variable ui in (11790 . Since 
all first and second-order derivatives of h{x) have a decay which is 0((1 + 
\x\)~'^~'^), we derive from (I172p that for any u = {un)n<d £ I^'^ and v = 

iVn)n<d e 



dg{u,v) 
dui 



< C \\Ht\\^ ( 1 + (1 - II Vrlloo) \ (181) 
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and from (11731) 

dg{u,v) 



dvi 



<C||Vr|L l + kl(l-||Vr| 



-d-l 



(182) 



In the integration by part, integrating 2'^^g{z, x' + 2^{u — z)) brings out a term 
proportional to and differentiating g{u,x) g{u,x')2'^''g{y,x + 2\u — y)) 
brings out a term bounded by 2'. An upper bound of (11791) is obtained by 
inserting (I174II180[ I181|182l) , which prove that there exists C such that 



< C2 2'-^'(||Vr|L + ||//r||oo)^ 



The same calculation proves the same bound on J \ kij{y,z) \ dz so Schur's 
lemma (113ip implies that 

\\QiQj\\<C^'2}-^{\\VT\U + \\Hr\\^f . 

Applying Cotlar's lemma ([176]) with /3(j) = C 2-l-'l/2 (|| Vr||oo + \\HT\\^f 



proves that 

+ 00 

II ^ i^,iir,,i|| = ||5^g,||<C(||Vr|U + ||i^r| 



OO ) 1 



(183) 



3=-oo 

which implies (11641) . 



F Proof of Lemma 13.6 



It results from (l85l) that there exists ej with limj_>.oo ej = such that 



sup 

peVj-n 



SjW- 



\Sj[p]f\\ 
\Sj[p]6\\ 



Sj[p]S 



and Epgn5 



\Sj[p]fr<ej\\fr/8. Since \\Sj[Vj] 



we get 



peVj 



\Sj[p]f\\ 

ISjIpM 
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Sj[p]5 



<ej 



(184) 



The set of all extensions of a p G Vj into Pj+i is defined in (15^ . It can 
be rewritten Vj_^_-^ = Vj+i fl Cj{p), and (HOl) proves that 

\\sj[p]f - Sj[p]hr > w^jMf - sj^iIpW- 

p'eVj+inCj{p) 
Iterating k times on this result yields 



\\Sj[p]f - Sj[p]hf > Yl \\Sj+k[pV - Sj+k[p 

Applying it to / and h = fip6 with /ip = ||5'j[p]/||/||5'j[p]5|| gives 



Sj[p]f - f^pSj[p]5 



p'ePj+fenCj+fc(p) 



Sj+k[p']f - fJ-p Sj+k[p']S 



Summing over p gVj and applying fll84p proves that 



E E 

peVjp'eVj+knCj+k{p) 



and hence 

E E 

pePjp'GPj+fcnCj+fc(p) 



\SMp']f\\ \\Sj[p]f\\ 



\Sj+k[p'n \\Sj[p]5\\ 



\Sj+k[pW<^j 



If q E Cj+,ip') then Sj+,iq) = \\Sj+k[p']f\\/\\Sj+k[p']5\\. But p' E Cj{p) 
so q E Cj{p) and hence Sj{q) = \\Sj[p]f\\/\\Sj[p]6\\. Finally \\Sj+k[pW = 
fj.{Cj+k{p')) so the sum can be rewritten as a path integral 



\Sj+kf{q)-Sjf{q)\'dfi{q)<ej 



■po 



which proves that {5'j/}jgN is a Cauchy sequence in L'^(Voo,dfi). 



G Proof of Lemma 14.8 



This appendix proves that 

E{\K,Xf)<E{\\K4')E{\X\') 



;i85) 
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as well as a generalization to sequence of operators, at the end of the ap- 
pendix. The lemma result is proved by restricting X to a finite hypercube 
It = {{xi, ...,Xd) G M"^ : \/i < d , \xi\ < T}, whose indicator function 1/^, 
defines a finite energy process Xt{x) = X{x) l/j,(a;). We shall verify that 
E{\KrX{x)\^) does not depend upon x and that 

E{\K^X{x)\') = hm ^^''^-Jj"'^ . (186) 

Let first show how this result implies f ll85p . The L^(R'^) operator norm 
definition implies 

iKrXrf = j \KrXT{x)\^dx < \\Krf J \XT{x)\^dx . 

Since X and r are independent processes 

EHIK^XtW') < E{\\K4') E{\X\') {2TY • 

Applying f ll86p thus proves the lemma result f ll85p . 
To prove fll86p . we first compute 

E{\KrX{x)\^) = E(^j j kr{x,u)k*{x,u)X{u)X*{u)dudu'^ . 

Since X is stationary E{X{u) X*{u')) = Ax{u — u'), and the lemma hypoth- 
esis supposes that E{kr{x,u) k*{x,u')) = kr{x — u,x — u'). Since X and r 
are independent, the change of variable v = x — u and v' = x — u' gives 



E{\KrX{x)Y) = I I kr{x - u,x - u') Ax{u - u') dudu' 

kriv, v') Ax{v — v') dv dv' , (187) 
which proves that E{\Kt-X{x)\'^) does not depend upon x. Similarly 

E{\KrXTix)\^) = j jkr{v,v')Ax{v-v')li^{v-x)li^{v'~x)dvdv\ (188) 
and integrating along x gives 
{2T)-'^ EiWK^XrW'^) = j j kr{v,v') Ax{v ~v') {I- pt{v -v')) dv dv' , (189) 
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with 

and hence 

d 

< Pt{v) < (2T)-i ^\vi\<d (2T)->| . (190) 

i=l 

Inserting fll87p in f ll89p proves that 

{2T)-'^E{\\K^XTf) = E{\K^X{x)\^)- [ [ Kiv,v') Axiv-v') priv-v') dv dv' . 



(191) 

Since JJ\kr{v,v')\ \v-v'\dvdv' < oo and Ax{v-v') < Ax{0) = ^(jXp), it 
results from ffT9B and ffT90|) that 

hm (2T)-'^E(||ir,X^f ) = E{\K,X{x)\') , 

T->oo 

which proves (11861) . 

Lemma is extended to sequences of operators K^. = {-ft'r.njne/ with 
kernels {kr,n}nei, as follow. Let us denote 

\\KrXf = J2\Kr,nX\' aud \\K J f = \\Kr,nf \\' ■ (192) 
If each average bilinear kernel is stationary 

E{kr,n{x, u) k* „(x, u')) = kr,n{x — U,X — u') (193) 

and 

'^^kT.^n{v,v')\\v - v'\dv dv' < oo , (194) 
then 

E{\\K^X\\')<E{\\K4^)E{\X\') . (195) 

The proof of this extension follows the same derivations as the proof of (I185P 
for a single operator. It just requires to replace the L^(R'^) norm by the 
norm Ylm&i II /^P space of finite energy sequences {fn\n<^i of L^(R'^) 

functions and the sup operator norms in L^(M'^) by sup operator norms on 
sequence of L^(M'^) functions. 
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H Proof of Theorem 14.7 



This appendix proves that E{\\[Sj[Vj],Lr\Xf) < E{\\U[Vj]X\\if B{t) 
with 

B{t) =Ce[ (||Vr|U(log^^ V 1) + ||i/r|u)'}, (196) 

and E{\\U[Vj]Xh) = Et=o(E,eAj Emp]X\')Y\ 

For this purpose, we shall first prove that if for any stationary process X 

E{\\[Wj,U]Xf)<Bir)E{\X\') (197) 

where 

Ei\\[Wj , L^]Xr) = E{\[Aj , L^]X\') + J2 Ei\[W[\],L^]X\') 

agAj 

then 

E{\\[Sj[Pj] , L.]X|n < B{t) E{\\U[Pj]X\\,y . (198) 

Since a modulus operator is nonexpansive and commutes with L^-, with 
the same argument as in the proof of fl57|) . we derive from (11971) that 

E{\\ [Uj, U]Xf) < B{r) E{\X\^) . (199) 

The proof of Proposition 14.21 also shows that Uj is nonexpansive for the 
mean square norm on processes. Since Sj\Pj] is obtained by iterating on Uj 
it results that 

E{\\[Sj[Vj], L,]Xf) < B{t) E{\\U[Vj]X\\,f . 

The proof of this inequality follows the same derivations as in Appendix [Dl 
for L = L., by replacing / by X, by E{\X\''), \\U[p\ff by E{\U[p]X\''), 
and the L^(]R'^) sup operator norm ||[t/j,L]|| by B{t) which satisfies (11991) 
for all X. 

The proof of (I196p is ended by verifying that 

E{\\[Wj,Lr]X\\^)<E{C\T))E{\X\'') (200) 
and hence B{t) = E{C^{t)) with 



C(r) = C (ll Vr|U(log V 1) + \\Ht\ 
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The inequality fl200p is derived from Lemma 12.141 which proves that the 
L^(R'^) operator norm of the commutator [W^j,//,-] satisfies 

\\[Wj,U]\\<C{r) , (201) 

and by applying to K-^ = [Wj,L^] = {[Aj,L^] , [W[\] , Lr]}xeAj the exten- 
sion f ll95p of Lemma I4.8[ This extension proves that if the kernels of the 
wavelet commutator satisfy the conditions f ll93p and f ll94p then 

E{\\[Wj,L^]Xr)<E{\\[W,Lr]r)E{\X\'). 

Together with fl20T|) it proves fl200|) . 

To finish the proof we verify that the wavelet commutator kernels satisfy 
(II13D and ffTMj) . If Zjf{x) = fi^hj{x) with hj{x) = 2'^^h{2^x) then the kernel 
of the integral commutator operator [Zj, L^] = ZjL^- — L-^Zj is 

krj{x,u) = hj{x — u — t{x)) (202) 
-hj{x -u- t{u + t{P{u)))) I det(l - Vr(u + t{P{u))))\-^ 

where (3 is defined by (3{x) = x + r(/3(x)). The kernel of [Aj, L^] is k^^j with 
h = (f), and the kernel of [iy[A],L^] for A = 2^r is krj with h{x) = ip{r~^x). 
Since r and Vr are jointly stationary, the joint probability distribution of 
their values at x and u + t{I3{u)) only depends upon x — u. It results that 
E{krj{x,u) kr,j{x,u')) = kr,j{x — u,x — u') which proves the kernel station- 
arity fll93p for wavelet commutators. 

The second kernel hypothesis ( I194p is proved by showing that if \h[x)\ = 
0((1 + \x\)-'^-^) then 

I krj{v,v') \ \v — v'\ dvdv' < oo . 
j>-J 

Since krj{v,v') = E{kr. ')), it is sufficient to prove that 

there exists C such that for all x, with probability 1 

I = \K,ji^^^-^)\\Kji^^^-^')\\^-^'\dvdv' <C . (203) 

Since hj{x) = 2'^^h{2^x) and u + t{(3{u)) = /3(m), it resuhs from f l2U2]) that 
kr,j{x, X — 2^^w) = 2'^^ krj{x, X — w) with 

kr,j{x,x-w) = h{w-2^T{x)) (204) 
-h{vu - 2^t{I3{x - 2-^vu))) I det(l - Vr(/3(x - 2~^w)))\-^ . 
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The change of variable w = 2^v and w' = 2 ^v' in (12031) shows that / = 
E,>- J 2-^7, with 

Ij = J J Ik-rjix, X — w)\\kr,j{x, X — w')\\w w'\ dw dw' . 

Since \h{w)\ = 0((1 + \w\y^~'^) and ||Vr||oo < 1/2 with probabihty 1, by 
computing separately the integrals of each of the four terms of the product 
\krj{x,x+w)\ \krj{x,x+w')\ \w—w'\, with change of variables, y = w+2^t{x) 
and z = w + 2^t{I3{x + 2~^w)), we verify that there exists C such that Ij < C 
and hence that I = Y.j>-j 2"^^i < 2-^+^C' with probability 1. It proves fl203|l 
and hence the second kernel hypothesis (11941) . 
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